THE BELL SYSTEM 





DEVOTED TO THE SCIENTIFIC & AND ENGINEERING 


ASPECTS OF ELECTRICAL COMMUNICATION 





VOLUME XLIII JULY 1964 NUMBER 4, PART 2 


SST aoe ne 





Timing Errors in a Chain of Regenerative Repeaters, ITI 
. B. K. KINARIWALA 1481 
The Maser Rate Equations and Spiking D. A. KLEINMAN 1505 


Design of Wideband Sampled-Data Filters 
R. M. GOLDEN AND J. F. KAISER 1533 


The ALPAK System for Nonnumerical Algebra on a Digital 
Computer—III: Systems of Linear Equations and a Class of 
Side Relations J. P. HYDE 1547 


A Technique for Measuring Small Optical Loss Using an Oscil- 
lating Spherical Mirror Interferometer 
A. J. RACK AND M. R. BIAZZO 1563 


On the £-Boundedness of Solutions of Nonlinear Functional 
Equations I. W. SANDBERG 1581 


A Ffrequency-Domain Condition for the Stability of Feedback 
Systems Containing a Single Time-Varying Nonlinear Element 
I. W. SANDBERG 1601 


The Resistance of an Infinite Slab with a Disk Electrode 
G. F. FOXHALL AND J. A. LEWIS 1609 


Permutation Groups, Complexes, and Rearrangeable Connecting 
Networks V. E. BENES 1619 


Optimal Rearrangeable Multistage Connecting Networks : 
V. E. BENES 1641 


Attitude Determination and Prediction of Spin-Stabilized Satellites 
L. C. THOMAS AND J. O. CAPPELLARI 1657 


Directional Control in Light-Wave Guidance S. BE. MILLER 1727 


Alternating-Gradient Focusing and Related Properties of Con- . 
ventional Convergent Lens Focusing S. E. MILLER 1741 
(Contents continued on inside back cover) 


Serer cet ees 





COPYRIGHT 1964 AMERICAN TELEPHONE AND TELEGRAPH COMPANY 


THE BELL SYSTEM TECHNICAL JOURNAL 


ADVISORY BOARD 


P, A. GORMAN, President, Western Electric Company 
J. B. FISK, President, Bell Telephone Laboratories 
J. E, DINGMAN, Executive Vice President 


American Telephone and Telegraph Company 


EDITORIAL COMMITTEE 


A. 


Cc. DICKIESON, Chairman 
. T, ANDREWS, JR. K. E. GOULD 
. J. BUSCH W. C. HITTINGER 
R. P. CROSS J. R. PIERCE 
R. L. DIETZOLD D. E. PROCKNOW 
R. W. EHRLICH E. C. READ 


EDITORIAL STAFF 


. E. SCHINDLER, JR., Editor 

M. COLE, JR., Assistant Editor 

T. mysAK, Production and Illustrations 
N. POPE, Circulation Manager 


soe F © 


THE BELL SYSTEM TECHNICAL JOURNAL is published six times a year 
by the American Telephone and Telegraph Company, E. J. McNeely, President, 
C. E. Wampler, Vice President and Secretary, J. J. Scanlon, Vice President 
and Treasurer, 195 Broadway, New York, N. Y. 10007. Subscriptions are accepted 
at $5.00 per year. Single copies $1.25 each. Foreign postage is $1.08 per year or 
18 cents per copy. Printed in U.S.A. 


THE BELL SYSTEM 
TECHNICAL JOURNAL 


VOLUME XLIII Juuy 1964 NUMBER 4, PART 2 


Copyright 1964, American Telephone and Telegraph Company 


Timing Errors in a Chain of 
Regenerative Repeaters, III 


By B. K. KINARIWALA 
(Manuscript received March 2, 1964) 


We present here a general and rigorous theory of the jitter accumulation 
in a chain of regenerative repeaters. The sources of jitter are assumed to be 
the stgnal-dependent sources, as distinguished from purely random sources 
independent of the signal. 

Our results show that while the absolute jitter and its dispersion grow with- 
out bound with the number of repeaters, the spacing and the alignment jitter 
remain bounded. In particular, the spacing jitter bounds are quite optimistic 
for most practical situations, viz., no greater than twice the absolute jitter 
injected at a single repeater. This result 1s of importance in that tt ensures 
proper decoding of the binary signal. Its further emportance is that it 
does ensure, in most cases, the validity of the basic model and thus the valedity 
of other results obtained by that model. One such result shows that the align- 
ment jitter 1s slowly-varying for repeaters farther along the chain. We also 
include some results which would be of use in computations, together with a. 
sumple example. 


I, INTRODUCTION 


1.1 Purpose 


Pulse regeneration is an attractive feature of digital communication 
systems. A regenerator or a regenerative repeater must: (2) detect the 
presence or absence of a pulse at certain time instants which are, ideally, 
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multiples of the basic pulse repetition period, (727) regenerate the pulse 
shapes, and (277) retime these pulses so they occur at appropriate in- 
stants of time in the outgoing signal. In practice, errors in detection due 
to noise, distortion, etc. in the system and errors in retiming the signal 
impose limitations on the operation of such systems. Except for noise 
which may lead to the detection of a pulse where no pulse exists in the 
original signal and vice versa, the imperfections in the system show up as 
a jittering of the pulse positions in the outgoing signal. In self-timing 
repeaters,! the jitter from all sources except random noise is also de- 
pendent on the signal itself.? 

Several workers have investigated many different aspects of the tim- 
ing jitter problem.!-® We study here the signal-dependent jitter due to 
repeated regeneration. This article leans quite heavily on our previous 
discussion, and in fact it supplements as well as complements the previ- 
ous results referred to in this article as Parts I and II.’ 


1.2 Background 


In our previous discussion, we showed how the timing jitter in a pulse 
train accumulates as the pulse train is repeatedly regenerated. The tim- 
ing information was assumed to be extracted from the incoming signal 
itself, and the timing extractor was assumed to be a tuned circuit. Mis- 
tuning in the tuned circuit was assumed, as a convenience, to be the 
major source of jitter. However, the accumulation properties of timing 
jitter are not dependent on a particular source of jitter. We shall attempt 
to clarify this point here. 

Our previous results showed that the displacements of the pulse posi- 
tions from their original positions (or the “absolute jitter’) increase in- 
definitely with the number of repeaters. The major component in the 
absolute jitter was found to be flat delay (i.e., the same displacement 
at every pulse position). A natural question follows: How does the ab- 
solute jitter behave if the flat delay is removed? This is the ‘‘dispersion”’ 
or the absolute jitter measured against the reference clock delayed by an 
appropriate amount. It was shown that the dispersion also increases 
without bound except when the pulse trains are severely constrained 
(e.g., periodic, finite, etc.). These results are valid even under the con- 
straint that there exists at least one pulse in a predetermined number of 
the basic periods or “time slots.” 

It is worth noting that the absolute jitter and the dispersion have as 
counterparts the average and the variance of the random variable repre- 
sented by the pulse displacement at any pulse position. We wish to em- 
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phasize, however, that our results are independent of any a priori sta- 
tistics concerning the pulse train ensemble. 


1.3 Results 


In the present article, our most significant result concerns a more im- 
portant parameter, viz., the ‘‘spacing jitter,’ which is the variation in the 
spacing between adjacent pulses. We show that the spacing jitter is 
bounded for an indefinitely long chain of repeaters, and the bound is di- 
rectly related to the minimum pulse density. Such bounds may be pre- 
cisely evaluated both for the infinite as well as the finite chain of re- 
peaters. The importance of these results lies in the precise evaluation of 
the bounds, the means to control these bounds, and in our ability to re- 
late these bounds either with resulting errors in the decoding of the sig- 
nal,? or with distortion in the analog signal, depending on whether the 
bounds are large or small. As will be seen later, these results also deter- 
mine the validity or otherwise of all the other previous results on the 
timing problem, since the present results have a direct bearing on the 
validity of the model used by most people. 

We also present a rather thorough discussion of the computational 
aspects of the problem in the Appendix. Special situations such as peri- 
odic patterns, truncations, pattern transitions, etc. are included in our 
discussion. 

The case of nonidentical repeaters is examined briefly. It appears that 
the bounds on jitter are not appreciably different if the repeaters are not 
appreciably different. The ‘misalignment,’ or the jitter introduced by 
a single repeater in an already jittered pulse train, is also examined 
briefly. We show that the misalignment is slowly-varying for repeaters 
further along in the chain. 

Let us emphasize, in conclusion, that our results are not dependent on 
any a priori statistics. Our analysis is quite rigorous once the basic model 
is derived. The basic model is essentially the same as that of other in- 
vestigators, and the major assumption in the model asserts that a single 
repeater introduces only slowly-varying jitter in a jitter-free pulse train. 
Such an assumption is quite reasonable for any practical repeater. 


1.4 Organization 


We start with a mathematical statement of the problem. Our formula- 
tion shows that the input and the output jitter sequences are related to 
each other by a linear operator which maps the space of bounded se- 
quences into itself. The dimensionality of the space is determined by the 
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memory of the system, which is infinite for an infinitely long chain of re- 
peaters. We are thus led to a discussion of the operator in a Banach space 
of infinite dimensions. The spectral properties of the operator determine 
the behavior of the absolute jitter and the dispersion. Next, we consider 
the spacing jitter obtained by a difference operation on the absolute 
jitter. The brief discussions on the misalignment, unequal repeaters, 
etc. follow. Finally, we present several results to facilitate computations. 


II. STATEMENT OF PROBLEM 


The basic model of the repeater is represented by a tuned circuit ex- 
cited by a train of pulses. The natural frequency of the tuned circuit is 
assumed to be very close to the pulse repetition frequency. At upward 
(or downward) zero crossings of the response of the tuned circuit, timing 
pulses are generated which determine the instants of outgoing pulses; 
the presence or absence of a particular pulse in the output signal is de- 
termined by the presence or absence of a pulse in the input signal. 

Let {--- , —27, —7,0,7, --+ } represent the instants of occurrence 
of pulses for the ideal pulse train. These would be the centers of the cor- 
responding time slots. The occurrence or nonoccurrence of a pulse at 

= —vnr is determined by the value of the random binary variable @,, . 
A pulse is present when @, = 1 and no pulse is present when @, = 0. 

At this point it is convenient to assume that the pulse train consists 
of impulses located at the actual pulse positions defined above. The 
actual pulse shapes modify the zero crossings of the response of the tuned 
circuit, and a term representing such a correction can be added sepa- 
rately. 

Finally, let &' be the displacement of the kth pulse (originally located 
at t = —kr) at the output of the /th repeater in a chain of repeaters. At 
the input of the /th repeater the timing displacement is given by &'’, 
which is the jitter valuc at the output of the (J — 1)th repeater. The dis- 
placement (or jitter) is measured in radians, where 27 corresponds to the 
pulse interval 7. 

In order to determine £;', we merely find the appropriate zero crossing 
of the response of the tuned circuit excited by a train of pulses. The ex- 
citing pulse train is itself jittered and this fact is represented by the 
values of {é'"}. It turns out that the &' is actually a nonlinear function 
of the set {é 4, } with n = 0, 1, 2, --- . Furthermore, it also depends 
on the original signal represented by the set {@,} and, of course, the Q 
of the resonant circuit. If, however, {&4' '} satisfy certain conditions, 
it is possible to represent the é,' as a linear function of the variables 
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{£4}. The function is still dependent on {@,} and the Q of the cir- 
cuit. This is the fundamental relation between the input and output 
jitter and, if there were no jitter introduced by the repeater, it would 
take the following form: 


De QntiB” Ente 
‘= aes (k =0,1,2,---;l1= 1,2,---), (1) 


2 Qn+i8” 


where 6 = exp (—7/Q) © 1 — (/Q) for large Q. 
The conditions that must be satisfied by &’” are the following: 


[& 7 — &'T|<«K2/Q forall k. | (2) 


These conditions are unaltered when (1) is slightly modified to include 
the jitter introduced by each repeater [cf. (8)]. It is therefore very im- 
portant to make sure that (2) holds in order for any of the results ob- 
tained on the basis of (1) to be valid. The quantity required to be small 
in (2) is the spacing jitter, whose behavior is important in that if it ever 
becomes too large there will occur errors and distortion in the decoded 
signal.?-4 It is our intention here to investigate thoroughly the behavior 
of the spacing jitter. Equation (1) represents the way in which jitter 
propagates along a chain of repeaters. The jitter accumulation properties 
of a chain are, therefore, a consequence of this basic equation. Of course, 
at every repeater there is jitter injected in addition to the one propagated 
from previous repeaters. Since we are not discussing here effects of ran- 
dom noise, the sources of the injected jitter are signal-dependent, and 
they are identical if we assume identical repeaters. The jitter injected at 
every repeater by signal-dependent sources is thus identical. For such 
sources, this injection of jitter is simply additive either at the input end 
of the repeater? or at the output end.?3:%7 For example, dispersion in the 
channel with a consequent imperfect detection of the pulse positions 
would be an additive source of jitter at the input of the repeater.® Cer- 
tain nonlinear operations (such as limiting) on the response of the tuned 
circuit would inevitably alter the zero crossings, and this may be repre- 
sented as an additive source at the output end of the repeater. If the 
mistuning of the resonant circuit is small, it can be shown that mistuning 
represents an additive source at the output end. Finite pulse widths also 
represent an additive source at the output.” In any case, such injection 
of jitter depends on the signal and repeater parameters. Since these are 
the same at every repeater, the injected jitter is the same at every re- 
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peater. Additivity of these sources is usually a consequence of the fact 
that the injected jitter is small. For convenience, we will refer all these 
sources to the output end and represent the injected jitter by {&'}, the 
output of the first repeater, where the original pulse train is assumed to 
be jitter-free. It is somewhat inconvenient to refer all sources to the in- 
put, since it would involve inverting the functional relationship of (1). 
Our interest in this article is not to discuss the quantitative evaluation of 
{&}. For simple sources such as mistuning and finite pulse widths, it is 
relatively easy to evaluate {£,!}. We shall discuss elsewhere the question 
of determining {1} when all sources are included. However, let us 
emphasize that we do not restrict the sources of jitter to short memory 
mechanisms.° 

In this paper, our interest is to determine the behavior of {&’} for 
large / when the basic equation (1) is modified to include the injected 
jitter: 


D3 Crab fee 
& = “2 _______ + &, (3) 
Qn+i8” 


Mes 


n=0 


subject to condition (2), and where we assume that &° = 0. Condi- 
tion (2) is satisfied by {&°} since they are all zero and by {é,'} because 
for a practical repeater the jitter injected by a single repeater must be 
extremely small. We have made no assumptions on the nature of the 
signal. Thus, if it can be shown that condition (2) is valid for every 
1, then the results obtained by using (3) are valid. This is an important 
point which cannot be overemphasized. We will therefore pay particu- 
lar attention to the behavior of the spacing jitter. 


III. RECAPITULATION 


In this section, we recapitulate the basic formulation of the problem 
developed in Parts I and II of this article. For details, the reader is re- 
ferred to the original discussion. Define a vector 


X= { £0", f', fe, +t, (J _ 1, 2, saa) (4) 
and Xo = 0. Then, the basic equation (3) may be written as 


X = TX 1-1 + xX , (5) 
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where 
@ @8 @B | 
So So So 
gp GG GB, 
hr=| % & (6) 
j. Yo Oe ke 
S89 | 
and 
si = 2, GiB". (7) 


We have thus expressed our original problem in terms of an operator 7'p 
which maps the Banach space 1, (the space of bounded sequences) into 
itself. We are interested in the behavior of X, as / approaches infinity. 
The operator Ty is the most general one. However, there is some interest 
in the behavior of the jitter when the pulse trains are periodic and the 
operator 7’) under steady-state periodic condition (cf. Part 1) becomes 





@ GB | Gab" 
So! So’ So’ 
Go" Gr Gna 
Ay: = Sy’ sy sy’ ; (8) 
OS he. ge “2 
Sa 4" Sm—1 


where s,/ = (1 — 6”)s, for all n. Finally, it is more convenient to write 
the operator T> explicitly to indicate only those positions where the 
pulses are present. This leads us to the operator 


1 gB” gre A | 
So So So 
T = 1 Bg a) mis , (9) 





Si St | 
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where 
So = So (10) 


and 
S,a=1+pS,. (11) 


The vectors X; are also assumed to be suitably modified. We will use 
the operator T of (9) to represent our quantities of interest.* The operator 
T) will give identical results. The special restriction to the periodic case 
will be of interest when we discuss computational aspects. The operator 
Ao is also assumed suitably modified to A. Several parameters of interest 
may now be expressed in terms of the operator 7 and the injected jitter 
element X;. 


3.1 Absolute Jitter 


Trom (5), we have 


i-1 
Xi= bP "| Xi (12) 
v=0 
and in the limit 


Y = lim b r |x. (13) 


I+0 Lyv=0 


3.2 Dispersion 


This is obtained by subtracting the average delay from the absolute 
jitter. A delay element is represented to within a constant by {1,1,1, ---}. 
This element happens to be an eigenvector of 7’ corresponding to an 
eigenvalue at \ = 1. Under the condition that \ = 1 isa pole (cf. Part 


II) of 7, we can represent the dispersion by 


I-1 
= (I — &)X, = E — B,| > | Xi, (14) 
v=0 
where £; is the projection operator FE; (\ = 1; T) which takes on the 
value “one” in the neighborhood of \ = 1 and zero elsewhere. 


* It should be observed that the condition (2) is also modified. The right-hand 
side of (2) now becomes z/Q times the number of basic periods r between two ad- 
jacent pulses. 
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3.3 Spacing Jitter — 


This is obtained by subtracting from the absolute jitter at one pulse 
position the absolute jitter at the adjacent pulse position (not neces- 
sarily the adjacent time slot). 


i-1 
xt=[1-s]or]x, (15) 
v=0 
where S is the shift operator given by 


i 1 0 O 
0 O 1 Qi) “ate 
°= | 9 0 0 is eae (16) 


3.4 Alignment Jitter 


This quantity is given by the difference of Xy4; and X; : 
Xig* = Xin — X= T'R. (17) 


Finally, we summarize here without proof the properties of the oper- 
ator T which were determined in Part II of this article. 

(a) The operator 7 is a bounded operator mapping 1, into itself for 
1 < ps o. In particular, the norm of T in 1,, is equal to one (i.e., 
[Fp = 1): 

(b) The spectrum of T is a subset of the unit disk (ie., | «(7') | S 1), 
and any pole d of 7 with | | = 1 has order one. 

(c) All points in the unit circle except \ = 1 are in p(T’), the re- 
solvent of T’. The point \ = 1 is an eigenvalue of 7 with the eigenvector 
{1, 1, ---}. The dimension of the eigenmanifold is one in this case. 

(d) The point \ = 1 is the limit point of the point spectrum of 7 if 
the domain of T' is unrestricted. It is a pole of T for very special cases, 
such as periodic pulse trains, truncated pulse trains, ete. 


IV. ABSOLUTE JITTER AND DISPERSION 


The results for absolute jitter follow immediately from (13) 
I-1 
y= tim| 52" | x. 
l+o Lv=0 


Observe that \ = 1 is an eigenvalue of 7’, and since in general X, is any 
element of 1,,, the limit in the above equation approaches infinity as / 
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approaches infinity. In particular, the norm of the operator 

K, = |= r"| (18) 
is 1. We thus observe that, in general, the absolute jitter grows linearly 


with the number of repeaters. 
If \ = 1 isa pole of T, the dispersion is given by 


l 
Xia? = [ io Alp r°| Xi 3 
I 0 


1 

2 | = rs | Xi (since 4,7 = TE), 
l 

yo] 


| 


where D = TE»y and Ep = (I — [). Here l’pX;, is the dispersion ele- 


BX, | (cf. Part IT); (19) 


ment due to the first repeater. The norm of the operator 
l 
King = P P| (20) 
v=0 


is bounded and converges to a finite value as / approaches infinity. This 
follows from the previous discussion (see Part II) where it was shown 
that 


| D” | < May”, (21) 


where M is a positive constant and ay < 1. We therefore find that the 
dispersion is bounded provided that \ = 1 isa pole of 7’. This is true for 
certain highly constrained situations. In particular, this is true when 
the domain of T is restricted to a finite dimensional subspace of 1,, which 
is invariant under 7’. Examples of such cases occur when the pulse trains 
are periodic, finite, ete. 

On the other hand, when } = 1 is not a pole of 7 the projection LE; 
does not commute with 7’. In this case, 


x? =[r-n][er]x 2) 


where 


E\T # TE. (23) 
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Furthermore, it can be shown that 
|(2 — £,)T"| = 1 for all m. (24) 


This is a consequence of the spectral properties of 7’ summarized in the 
previous section, viz., the point \ = 1 isa limit point of the point spec- 
trum of 7. In fact, we show in Part II of this paper that there exist 
elements in 1,, such that the norm of the operator 


Ki? = E — B,| [> | (25) 


is (1 + 1). It follows, therefore, that the dispersion grows without bound 
in the case of purely unconstrained pulse trains. The result is not altered 
even if some form of coding is provided to eliminate indefinitely long 
strings of zeros in the pulse trains. 


V. SPACING JITTER 


At the output of the /th repeater, the spacing jitter is given by (15), 
l-1 
Xv = E — s| bs r°| X. = [Kr]X.. (26) 


In particular, we are interested in knowing whether the quantity X/ 
remains bounded as / approaches infinity. Secondly, if it remains bounded 
we wish to determine the least upper bound for each J. Since there are 
no restrictions on X, (other than the requirement of boundedness), we 
are interested in determining the norm in the limit of the operator K,’; 
or, 


lim | K;'| = lim | (2 — 8)Kz| (27) 
1-00 I+00 


when we know that 


lim | Ki | @. (28) 
For physical systems we are also interested in | K,’ | for all U. 

All of our results in this section depend upon an important lemma con- 
cerning the operator K;’. We assert that K,’ has a representation simpler 
than the one given in (26) and prove this assertion by verification. 
Define an operator 


B = diag-{(Sp — 1)™, (8S; — 1)™, (&: — ee ae (29) 
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where S, ~* 1 are defined in (11).* Then we assert the following 
Lemma: 


l—1 
Ki = [ = s| > "| = BU -— TT". (30) 
v=0 
Proof: Observe that in (30) 


BI -T)T" = BUI -— T)T" [> | 


7=0 
So, we need merely show that 
i — S|} = B(T — 1). 
Or, 
Bt — B'S =T" —- IT, 
where 
B™ = diag-{(So — 1), (Si; — 1), ---} 
= diag-{So, S:, Se, ---} — J. 
Thus we need to show that 
diag-{So, Si, S2:,-::} —- BUS = T”. 
But from (11), 
(Sai = 1) SB Si 
so 
B™ = diag-{6"'S,, B°S2, +++}. 


Therefore, the lemma is proven if 


i —B"S8, 0 0 0 

0 S —B"S 0 QO «-:- 

Ps : . | (31) 
Po 0 So —B° Ss; 0 sae 


The truth of the above statement is verified directly by considering the 
products 7 'T = TT™ = I. The validity of (30) is thus proven. 


* It is quite possible that the S, approach unity. If any S, = 1, it implies a 
finite pulse train, and the question is analyzed very simply in a finite dimensional 
space as was done for the periodic case. Such cases, however, do not give us infor- 
mation for the infinite pulse trains which are required for long chains of repeaters. 
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We can now state the following 
Theorem: The operator K7' ts bounded for all Lif inf 8, 2 a> 1. 
Proof: From (30), we have 


[Ki | =|BU— TT" | 


IIA 


Se meee es Be aaa 
We know that | 7" | = 1 for all land | 77' | is finite from (31). Hence 
| K,° | is bounded if B is bounded, which it is if inf S, = a > 1. The 


theorem is thus proven. 


Corollary: 
lim | Kv | < . 
I-+00 


This follows trivially since the bcundedness of K,’ was proven inde- 
pendently of /. Our assumption that inf S, 2 a > 1 isa simple assertion 


of the fact that indefinitely long strings of zeros are ruled out on any 
communication channel. 
Next, we wish to determine what precisely is the norm of the element 


X; = K/X, 


as it relates to the norm of X,. Let us recall that X, is the jitter in- 
jected (at a single repeater) referred to the output of the repeater. The 
injected jitter referred to the input of the repeater is 


XS KG, (32) 


where 7'— is defined in (31). We find it more convenient to work with X 
in what follows. It is obvious, of course, that our entire discussion could 
have been carried out in terms of X from the very start. We chose not 
to do so in order to avoid a premature discussion of 7'-!. The sequence 
represented by X naturally satisfies condition (2). In fact, the use 
of X allows a much simpler comparison of the spacing jitter at different 
repeaters along the chain. We are interested in the behavior of 


XxX; = [Kr T]X (33) 
for each /. This is obtained by a precise evaluation of the norm of 


Ri = K*T = BU — T'). (34) 
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Let us first define Sins = inf S, . Then, we state the following 


Theorem: The norm of Ry is given by 








iy (35) 


Proof: 
Rf = BI — T’). 


Let us consider the representation of (I — T’). The diagonal elements of 
(I — T') are of the form [1 — (1/S,’)], whereas the off-diagonal cle- 
ments are all negative. Also, the sum of the elements in each row must 
be zero. If we multiply (J — 7’) on the left by B, then the diagonal 
elements are of the form [1/(S, — 1)][1 — (1/S,’)]. Again the off- 
diagonal elements are all negative and the sum of the elements in cach 
row of B(I — T') is zero. Hence, in 1,, 


ES 1 eee 
pal r)|=2(,44) (1 a) 


2 i—1 
Sine 2, Sin? 

The theorem is thus proven. Observe that this is not just a bound but a 
precise norm. This value in the norm is taken by the spacing jitter X,’ 
for some X whose norm is one. In other words, the value in (35) represents 
the maximum magnification of X that is possible to yield the value of 
the spacing jitter. It is interesting to note that this worst case occurs for 
each / for the same element X, viz., {--- , 1, —1, —1, ---}, where the 
-+-1 corresponds to the position of 1/Sin¢ in the matrix representing the 
operator 7’. Finally, to observe the maximum possible growth of the 
spacing jitter as it compares with the maximum possible spacing jitter 
at a single repeater, we compare the results for! = land] = o~. 











gas 

fi Sint 26) 
Suet 

Re = 3. (37) 


The ratio of the quantities R,,° to Ry’ is less than two if Sinz is at least 
greater than two, which is to be expected in most physical situations. 
For example, if Q is of the order of 100 and there is at least one pulse 
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present in fifteen time slots, then Sin; is at least two or greater. Thus, we 
observe that not only is there a bound on the growth of the spacing 
jitter but that the growth is monotonic and levels off rather fast. Of 
course, this is a rather general result which, if desired, can be more spe- 
cifically stated in terms of the a priori probability distributions of the 
binary signal. Furthermore, the importance of this result lies in specify- 
ing the conditions for the validity of our model. Under most realistic 
situations it should be clear that conditions (2) are met at every re- 
peater. Of course, there are situations when these conditions are not met 
and the results obtained by the use of our model [cf. (8)] can no more 
be relied upon. However, we show that under most situations the results 
are reliable and very optimistic, as shown by (86) and (37). 

The above results are the most crucial ones in this paper. The rest of 
the paper is devoted to a varied miscellany that has some bearing on dif- 
ferent aspects of the timing problem. 


VI. ALIGNMENT JITTER 


The alignment jitter in the (J + 1)th repeater is given by (17), 
xX mae — rx 1. 


Obviously, for all / the alignment jitter is no greater in the norm than 
the absolute jitter X. This follows trivially since | T’ | does not excced 
one for any J. 

Tor more detailed insight into the behavior of the alignment jitter, 
we need to discuss specific situations. 

(a) If\ = 1 isa pole of T, then 7" converges to T” and X74,° settles 
down to a flat delay element for large 1. 

(b) If\ = 1 is nota pole of T, and inf S, = 1, 7’ does not converge 


to T”. However, for large / the alignment jitter is slowly-varying. 
(c) If \ = 1 is not a pole of 7, and inf S, = a > 1, the alignment 


jitter (for large 1) varies even more slowly than it does in (b). 

All the above results follow from the properties of T’. If\ = 1 isa pole 
of T, the result is obvious (cf. Part I). If it is not a pole of 7, the results 
follow from the fine structure of the spectrum of T. For example, in 
situation (b), all points in the point spectrum except \ = 1 are poles of 
T, whereas this is not true in situation (c). The corresponding eigen- 
vectors have different structures for the two cases (cf. Part II). 

It follows that the situation of (a) is to be preferred over that of (c), 
which in turn is preferable to that of (b). 
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VII. NONIDENTICAL REPEATERS 


This is a rather difficult matter to discuss with any great generality. 
What we hope to do here is to briefly indicate perhaps the most conveni- 
ent formulation and to give some reasons for believing that the orders 
of magnitude of the jitter parameters are not changed for small differ- 
ences in the repeaters. 

There are essentially three possible ways in which the repeaters may 
differ: (z) injected jitter, (27) Q of the repeater, and (777) mistuning. These 
differences appear mathematically in terms of different operators, mul- 
tiplicative coefficients in the power series, and so on. We examine each 
of these separately. 

If we assume that the injected jitter differs slightly at each repeater, 
then we may write (5) as 


XxX] = TXi-4 + May + Al, (38) 


where Xa, is the average injected jitter and A; represents the deviation 
from this average in the /th repeater. The norm of X, differs at most 
from the previous case of identical repeaters by 


l 
[44+ TAa+ PAe+-:- +77 | S d | Ar]. (39) 


If the | A;| are quite small, it is clear that the results will not be ap- 
preciably different from the previous ones. 

If the Q’s are different, then our basic operator is different for each 
repeater. It would be almost impossible to analyze such a case in general. 
However, we can make certain observations if we put 


1=1Tf+ Ki, (40) 


where T; is the operator representing the /th repeater, which is assumed 
to be an operator 7 perturbed by an operator K,. It is reasonable to 
expect | K;| « 1. Then (5) becomes 

X,= TXi1+ X14+ KX. (41) 


If | K,| S$ ¢ <1, then the norm of X; does not differ from the previous 
results by more than 





(5) ele Xi (42) 


Again, we see that the results are not appreciably different. 
In the case of mistuning, (5) takes essentially the same form as (38), 


X1 = TX14+ X4+ eX B, (43) 
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where X, is the injected jitter due to sources other than mistuning, e; 
is the mistuning in the /th repeater, and ¢)X z is the jitter due to mistun- 
ing (ef. Part I). Then 


i—1 I-1 
X1 = b r°| Xat P aa" | Xp. (44) 
v=0 v=0 


The contribution due to X,4 is unaltered and the contribution due to 
X , would depend on the specifications of «;. However, if we assume that 
the magnitudes of e; do not exceed one, then the contribution in the 
norm due to Xz cannot exceed / times the norm of X,. Thus the worst 
case for the absolute jitter does indeed arise from the assumption of 
equal ¢; at their maximum values. 

For spacing jitter we can make a slightly different statement for the 
contribution due to X,. For one repeater, the worst case (in the norm 
sense) occurs for the maximum value of e = 1, then the worst case for 
two repeaters is obtained by setting e = 1. Setting the first two re- 
peaters with e« = « = 1, the worst case for a string of three repeaters 
is obtained by setting «; = 1 and so on. The statement is a simple con- 
sequence of the inequality 


[eer Si G=2) |. (45) 


It is believed that a similar statement can be made for the alignment 
jitter. 

We thus observe that, when the differences in the repeaters are small, 
it is reasonable to expect that the results are not appreciably different 
from those obtained by assuming identical repeaters. 


VIIT. CONCLUSION 


We have presented a general and rigorous theory of the jitter accumu- 
lation in a chain of regenerative repeaters. The sources of Jitter are as- 
sumed to be the signal-dependent sources, as distinguished from purely 
random sources independent of the signal. 

Our results show that while the absolute jitter and its dispersion grow 
without bound with the number of repeaters, the spacing and the align- 
ment jitter remain bounded. In particular, the spacing jitter bounds are 
quite optimistic for most practical situations, viz., no greater than twice 
the absolute jitter injected at a single repeater. This result is of impor- 
tance in that it ensures proper decoding of the binary signal. Its further 
importance lies in the fact that it does ensure, in most cases, the validity 
of the basic model and thus the validity of other results obtained by that 
model. One such result shows that the alignment jitter is slowly-varying 
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for repeaters further along the chain. Finally, a brief discussion shows 
that the assumption of identical repeaters leads to results which are of 
the same order of magnitude as would be obtained if the repeaters dif- 
fered by not too great an amount. Some results which would be of use in 
computations are to be found in the Appendix, together with an example. 

In our discussion so far, we have investigated the accumulation prop- 
erties of jitter due to repeated regeneration. We have made no attempt 
to determine the jitter introduced by a single repeater. Analytically, this 
problem is complicated not only by the nonlinearities involved, but also 
by a lack of complete knowledge as to the actual mechanisms involved. 
We propose instead, in a later paper, an experimental approach which 
allows these measurements to be carried out under steady-state condi- 
tions. This experiment also has some bearing on the question of simula- 
tion of long chains of repeaters. 
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APPENDIX 


Kigenvalues and Eigenvectors of T 


Practical systems call for the evaluation of jitter when the number of 
repeaters is finite. In such cases, we need not concern ourselves with in- 
finite pulse trains. So long as the pulse trains are much longer than the 
effective memory of the system, the results obtained by considering 
finite pulse trains will be reliable. The results will also be reliable if the 
pulse trains are considered periodic with the period being greater than 
the memory of the system. Actually, as we shall see, the periodic pulse 
trains are much more difficult to work with than the finite ones. However, 
we shall discuss the periodic case in detail since much of the experimental 
work is carried out using periodic pulse trains. Finally, another case of 
interest is that in which there is a certain quiescent pattern which changes 
to a different one. This would include a periodic pattern changing to 
either a nonperiodic or a different periodic pattern. In each case, our 
interest is in determining the set of eigenvectors. Computations can then 
be carried out by expressing the injected jitter element in terms of the 
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eigenvectors (cf. Part I). If the set of eigenvectors is not complete, we 
employ the standard procedure and use the so-called generalized eigen- 
vectors. It should also be observed that the case of nonidentical repeaters 
is also handled using the same techniques. We present here certain sim- 
ple algorithms for determining the eigenvalues and the eigenvectors for 
the several cases of interest. 


A.1l Truncated Pulse Trains 


This is the case where the pulse train is finite. The matrix T in (9) is 
now finite, upper triangular and the diagonal element in the last row is 
unity. Such a matrix also arises in the case of pattern transitions where 
originally the quiescent pattern is periodic with only one pulse present 
in each period. This is also the more realistic case because the tuned 
circuit is thus properly excited. We consider the truncated case, therefore, 
together with the pattern transition case. 


A.2 Pattern Transitions 


Here we have a steady-state periodic pattern which changes to a dif- 
ferent pattern. The operator 7' can be represented as 


Be 
T= 
p | 


where P, representing the new pattern, is an upper triangular square 
matrix, A represents the quiescent periodic pattern, 0 is the null matrix, 
and C' is the connecting matrix. The matrix A is either an arbitrary 
positive stochastic matrix for an arbitrary periodic pattern or it is a 
scalar (viz., unity) for the case of only one pulse present in each period. 
The latter case also occurs when the tuned circuit is excited by a refer- 
ence pulse train such as 101010 --- . 

In either case, the eigenvalues of T are given by the eigenvalues of 
P and those of A. The eigenvalues of the matrix A are discussed in the 
section dealing with the periodic case. The eigenvalues of P are given by 
the diagonal elements S,, ’. If A is a scalar, the only other eigenvalue is 
unity. All the eigenvalues are thus determined. 

Next, we observe that the eigenvalues of P are distinct. If not, for 
some n (say, n = 0), 


So=1+ p+ prt? +... + [BE ar M]So, 


which implies a steady-state periodic pulse train, contradicting the 
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transient nature of P. Thus the eigenvalues of 7 are distinct unless, of 
course, there is a periodic pattern involved. Let us reserve the periodic 
case for the next section. Then, the eigenvectors of 7 form a basis for 
the space of jitter vectors both for the truncated case and the pattern 
transition case when the quiescent pattern has only one pulse present 
in each period. 
The eigenvectors are given by the algorithm 
So 
” d 
End — St Le 


(cf. Part II) for each eigenvalue 2X. 

The eigenvalues and the corresponding eigenvectors are thus very 
simply determined when either the reference pattern has only one pulse 
in each period or the pulse train is finite. 


A.3 Periodic Patterns 


Let us start by assuming that the period is m and there are n pulses - 
in a period. Then 


M=y+tete- +4. 
Let a = (1 — 6”), and D; = S,a. Then, 














l Bo gitis gate tint 
D D D | De 
gitetin 1 B” piztetina1 
“Dh Dd oD D 
A= 5) 
alae ie l 
Dri Dra Dr 
A = det(Al — A) 
ADo = 1 =p ake pis 
a eee 
DyDy +++ Dra 


SE gh ae ND aed 


TIMING ERRORS 1501 











1 
PD Daya 
(AD) — a)  —B™ Did 0 Gr. tas 0 | 
0 (AD, — ao) 2 Dex 0 cay 0 
—p™ — pints Aw oes OD 4 a2 1) 
= | -7. ce ae = ( ae x (-4 ny) ST es 
Do Do D, 
DED 6-2 GEA) 
( Dpto D, pe De 
af, — 2)... (, _ 2 _ i 
: (. #) (a Bs) (a | 
= P(r) + QQ), 
where 
n—l 
= = Op 
TONS py (a #) 
and 


n=) 


coe ee Ser (. 2 a). 


p= 





Finally, after some manipulation it can be shown that 


A = (*21) + +P). 
ao Qo 
Thus the eigenvalues of A are given by the zeros of the polynomial 


R(A) = PO) — Br’, 
when 
n—1 
PA) = [Ta-s-). 
t=0 
The zeros of R(X) can be obtained from those of P(X) by root-loci con- 


siderations or other numerical methods. Since 6” is usually small, this 
can be handled easily on a digital computer. 
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Next, let us discuss the eigenvectors corresponding to the eigenvalues 
given by the zeros of R(\). We show that for each distinct eigenvalue of 
A, the eigenvector is given by the algorithm 

sre 


= r 
fey. = ood Ex. 


This would be true if & 4, = & for all k or, if 


(+) (2-J (5-9 


Cae ee 


The above is true if 


Sn—1Sn—2 aan So (+) P(A) = II (Spa = 1) 
k=1 
= [I (6**s;) 
k=1 
aes BSS. Save Sn 


= B”SoSiS.-++ Saa (since S, = Sp), 
or if 
P(A) a B™r", 


which is indeed true for every eigenvalue \. For a repeated eigenvalue 
A, if it exists, one must find a generalized eigenvector in the usual way. 
We have thus given simple algorithms for determining the eigenvalues 
and the corresponding eigenvectors of A. 
It should be mentioned that the algorithm for eigenvectors is the same 
as the one given above when A is a submatrix of 7 as in Section A.2. 


A.4 Example 


Let us consider a simple example to illustrate some of the points. Con- 
sider a system with repeaters having Q = 100 and signals having at least 
one pulse present in 10 time slots. Consider the case of a quiescent pat- 
tern (101010 --- 10) suddenly changing to a new periodic pattern with 
one pulse in 10 time slots. We are interested in determining the behavior 
of the first pulse after the transition. 
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The operator 7 has the form 


1 B 
Feces 1+ 81+ 62+ 6+ +++) 


0 eee ee a 
oe +e + a4.) C) 
0 1 

0 eee ee 

(1+ 6°-+A'+ +++) 


Let the jitter vector X; = {f, &, &, &, ---}, where & is the reference 
phase jitter for the quiescent pattern and (& — &,) is the change due to 
the transition. 

In our simple example it is clear that there are only two eigenvalues 
of T, viz., 0 = (1 — 6’)/(1 — 6 + 8B”) and), = 1. The eigenvectors 
corresponding to these eigenvalues are e7 = {(f — &), 0,0, ---} and 
= {1,8 , 1, +++}. 

The absolute jitter obviously increases without bound with the num- 
ber of repeaters. The dispersion remains bounded, since \; = 1 isa pole 
of T in our simple case. In the limit, the dispersion is given by 


1 i Sr 


The spacing jitter in the limit is given by 


1— 6+ 6" 
(EFF 
in the zeroth position and zero elsewhere. Finally, the alignment jitter 
approaches e; in the limit. 
The validity of the results is assured if 
1 Pre 2 + 10 
Cate (f — &) K 10 


iS 
Q ? 
or if 
7 ge Pr 5 
8) «105 (Fa) i: 
Thus, if the jump in the phase jitter, for a single repeater, due to the 


transition in pattern is much smaller than 18°, our results are valid. This 
requirement can be expected to be satisfied by most practical repeaters.° 
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The Maser Rate Equations and Spiking 


By D. A. KLEINMAN 
(Manuscript received March 6, 1964) 


The rate equations of Statz and De Mars giving the time development of 
the inversion and photon number in a maser or laser are discussed ana- 
lytically with the aid of a mechanical analogy in which a particle moves in 
a potential well under the influence of a viscous damping force. The coor- 
dinate of this particle is analogous to the logarithm of the light output of the 
laser, and the amplitude, period, and damping of the motion can be directly 
related to the parameters of the rate equations. Simple analytic approxima- 
tions are developed for all of the quantities of experimental interest in the 
spiking pattern of a laser. Four relationships are given, which do not con- 
tain any of the rate equation parameters, whereby a spike pattern can be 
tested to determine if it 1s consistent with the usual rate equations. Systematic 
procedures are described for extracting all of the information contained in 
spike patterns. 


I. INTRODUCTION 


The most fruitful approach for the discussion of maser and laser! 
behavior has been through rate equations describing the time rates of 
change of the atomic populations and the photon numbers of the electro- 
magnetic field. Bloembergen? introduced rate equations for the popula- 
tions in a paramagnetic maser and based his discussion on the stead y- 
state solutions without explicitly considering the photon field. On the 
other hand, Shimoda, Takahasi, and Townes? have considered the pho- 
ton rate equations and on this basis have given a theory of maser ampli- 
fication without explicitly considering the atomic populations. Statz 
and De Mars‘ have shown that the transient behavior of masers depends 
upon coupled rate equations for both the populations and the photons. 
A number of authors have rederived these equations and discussed their 
applications to various maser systems. Considerable attention has been 
given to the question of whether these equations have periodic (un- 
damped) solutions. It has been shown by Makhov’ and by Sinnett® that 
the small-signal solutions are always damped, and it has been pre- 
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sumed and often confirmed by numerical computations that the same is 
true in the large-signal domain. Statz, et al.” have suggested that the 
damping of experimentally observed “‘spikes” in the output of lasers is 
probably sensitive to coherence and noise conditions, and not neces- 
sarily related to the damping predicted by the usual rate equations. 
They also suggest that the complicated spiking patterns frequently 
observed? are due to oscillation in many modes of the laser cavity. De- 
spite these doubts which have been cast on the adequacy of the Statz- 
De Mars rate equations for describing laser behavior, they still remain 
the logical starting point for any discussion of the power output of lasers, 
whether it be transient or steady-state, and of the dependence of power 
on the quality of the cavity, the intensity of the pumping light, or the 
linewidth, relaxation time, and concentration of the active atoms. 

Several ingenious suggestions have been made for modifying the rate 
equations so as to obtain periodic solutions. Statz and De Mars‘ and 
Makhov' propose that periodic solutions are obtained if terms are added 
to the rate equations representing cross relaxation in the inhomogene- 
ously broadened maser transition. On the other hand, Shimoda? sug- 
gests that periodic spiking can result if the losses due to absorption in the 
cavity can be partially saturated by the buildup of laser oscillations. Al- 
though these and other modifications may ultimately prove to be justi- 
fied and necessary in laser theory, we shall confine our attention in this 
paper to the original Statz-De Mars equations which relate the photons 
in a single cavity mode to a single quantity, the inversion, describing the 
atomic populations. We shall make it our task to understand as fully as 
possible the damped oscillatory solutions of these equations and how they 
may be applied to the study of the spiking phenomenon seen! in solid- 
state lasers. 

Rate equations in the simple form have been successfully applied by 
McClung and Hellwarth!® and by Vuylsteke" to the giant-pulse laser, 
which produces a single very short and very intense burst of radiation. 
Wagner and Lengyel!? have shown that an exact analytic solution can be 
obtained to a simplified rate equation which neglects spontaneous emis- 
sion and pumping during the pulse. By also neglecting the loss of pho- 
tons in the cavity Dunsmuir® has obtained a still simpler analytic solu- 
tion which is applicable to the rising portion of a spike or a giant pulse. 
We shall not consider further these exact solutions or the giant-pulse 
laser in this paper, but confine ourselves to those solutions representing 
repetitive pulsations, or spikes, in the ordinary laser. It is in this field 
that the greatest need now exists for an analytical discussion of the solu- 
tions of the rate equations. 
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A number of authors have endeavored to put the rate equations on a 
firmer theoretical basis. Following Anderson" and Clogston,!® who first 
described masers in terms of the density matrix, treatments using the 
density matrix to derive rate equations have been given by Fain, et al.,16 
Kaplan and Zier,” and Pao.'* By considering directly, without explicit 
use of the density matrix, the correlation functions of the electromag- 
netic field which are measured in simple maser experiments, McCumber!® 
has shown that the maser medium acts like a dielectric of negative con- 
ductivity; he concludes that the field and the dielectric satisfy rate equa- 
tions of the usual form, providing that the populations change by a small 
fractional amount during a coherence time (reciprocal linewidth) of the 
atomic system. Another formal theory, based on a successive approxima- 
tion approach to the quantum mechanical equations of motion, has been 
applied by Haken and Sauermann” to the frequency shifts and interac- 
tions of cavity modes in the laser. An extensive survey with bibliography 
of the early formal work has been given by Lamb.” In the present paper 
we shall not go into the theoretical basis for the rate equations, but con- 
fine ourselves entirely to the problem of solving the equations in the 
large-signal domain. 

Despite the fact that the rate equations are generally accepted, and 
that the general nature of the solutions has been familiar from digital 
computer calculations for some time,>"*-17:” it is still quite inconvenient 
in any particular case to compare observed spiking patterns in solid- 
state lasers with predictions of the rate equations. It has been neces- 
sary either to use the small-signal solutions and hope that they are not 
too inaccurate in the large-signal domain, or to resort to machine caleu- 
lations and try to fit three or more parameters to the data by trial and 
error. To be sure, much of the data on spiking is not amenable to analysis, 
consisting apparently of random spikes with widely varying amplitude, 
duration, and interval. Nevertheless, several laser systems are now known 
to give very regular pulsations of the type that might be consistent with 
the rate equations. Regular spiking patterns have been observed in 
CaF,:Ut by Sorokin and Stevenson® and by O’Connor and Bostick,”4 
and in CaWO,:Na*’ by Johnson and Nassau.2* Recently the effect has 
also been seen in a highly perfect ruby by Nelson and Remeika,?* and in 
a confocal ruby by Johnson, et al.?”7 More extensive studies of highly 
regular spiking have been reported by Gtirs® and by Hercher.*® Thus it 
is clear that good data on spiking patterns can be obtained, and it 
therefore becomes cogent to inquire into practical and convenient means 
for analyzing this data and obtaining information from it. 

The information contained in spike patterns is of two distinct kinds, 
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which we may call qualitative and quantitative. Qualitatively, we can 
determine quickly by means of relationships given here whether a spike 
pattern is consistent with the rate equations. If we determine that cer- 
tain patterns are not consistent, we are spared the waste of time that 
would result from attempting to fit these patterns numerically by trial 
and error. It is not obvious at this writing that any of the regular pat- 
terns that have been reported are consistent, because the application of 
the consistency relationships requires a measurement of the ratio of the 
peaks to the valleys in the light output, and only the peaks are seen in 
pictures published so far. Thus we suggest that by extending spike studies 
to include the valleys new and interesting information can be obtained. 
It is to be expected that patterns which are regular but yet not consist- 
ent will turn out to be physically the most interesting of all, since they 
will point the way to new understanding of laser behavior. The quanti- 
tative information can only be obtained from patterns which are con- 
sistent, at least in some average sense, and consists in obtaining values 
for the physical constants which appear in the rate equations: 

N = the number of active laser atoms in the optical cavity 

t, = the relaxation time of the upper laser level, usually due to spon- 
taneous emission 

tp = the photon lifetime in the laser mode of the cavity 

tm = the mode time, the time for spontaneous emission into the laser 
mode 

t, = the ground state time, the time spent by an atom in the ground 
state before being excited by the pump 

s = the source strength, the rate of production of laser photons by 
spontaneous emission, the pumping light, or any other noise source in 
the cavity, or any signal applied to the cavity. Although s may vary 
with time, it is convenient here to consider it with the constants. 

In principle all of these quantities except s could be directly measured 
or calculated from independent measurements on the laser material, the 
cavity, and the pump. The spiking data would then serve as confirmatory 
evidence. In many cases, however, spike patterns may prove to be the 
most convenient method of measurement. The last quantity, s, is in 
some respects the most interesting. The first assumption would be that 
s is due entirely to spontaneous emission into the laser mode; if so, s 
could be calculated and comparison with the measured value would re- 
veal the verity of the assumption. Experiments could be carried out with 
an external weak signal from a monochromator to test the response of 
the laser as observed in its spiking patterns. Thus we hope that the 
analysis given here will help to stimulate new experiments by making 
rate equation analysis more convenient for the experimentalist. 
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II. FORMULATION OF THE RATE EQUATIONS 


The laser is a system consisting essentially of an optical cavity with 
very high Q in a few modes, low Q in all other modes, the laser medium 
containing the active atoms, and a pump, usually an intense light source, 
to excite the atoms into a broad band of excited states. It is a property 
of the laser medium that the atoms decay from these states in an ex- 
tremely short time by nonradiative processes to one or more very sharp 
excited states, called the upper laser levels. The upper laser levels can 
decay radiatively to a sharp lower level, called the lower laser level, 
which may be the ground state. If the lower laser level is not the ground 
state, we shall assume that very rapid nonradiative decay processes 
return the atom to the ground state. Thus we may always neglect the 
population of the pumping band and of the lower laser level if the latter 
is not the ground state. The laser transition takes place from the lowest 
of the upper laser levels, but it may sometimes be necessary to take into 
account the populations of nearby levels in thermal equilibrium with this 
level. The statistical weights of the laser levels must also be taken into 
account.®° 

The rate equations may always be written in the form (p = photon 
number, n = inversion) 


dp pn 

ap aay a (1) 
dn _ m—” ~~ pn 

dbs oh ge (2) 


as long as we consider only a single mode of the cavity, the laser mode, 
and neglect all atomic populations except the upper laser level and the 
ground state. The time ¢) might be called the pumping relaxation time 
1 1 1 ; 
ras ii i tr ’ (3) 
since it represents the characteristic time in the response of the popula- 
tion inversion n to the pump. The inversion n may always be written 


n = Nu — (1/w)Ni, (4) 


where N,, Ni are the populations of the upper and lower laser levels 
respectively and w is the statistical weight of the lower relative to that 
of the upper laser level. In view of our assumptions, w will come in only 
when the lower laser level is the ground state. Let us suppose that the 
upper laser level is in thermal equilibrium’”” with certain other states 
not directly involved in the laser transition such that there is a tempera- 
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ture-dependent probability P that an excited atom is in the upper laser 
level. Then we have for a three-level system, in which the lower laser level 
is the ground state, 


Pay ae oe 
w 


1 1 
PN u(t — pte); 


and for a four-level system, in which the lower laser level is not the ground 
state, we have 


(5) 





no 


P 
Nn = PN t/t, . 


a 


(6) 


If the relaxation of the upper laser level is predominantly by spontance- 
ous emission to the lower laser level, there is a simple relation between 
t, , tm , and the linewidth Av (eps) of the laser transition 


tm = (810 nret'/c*) V t,Av, (7) 


where V is the volume of the cavity and Nr is the refractive index. This 
relation should not be taken too literally, since it takes no account of the 
anisotropy of the laser medium or the polarization properties of the 
transition, and V would have to be replaced by a suitable effective “op- 
tical volume” if the laser material does not fill the cavity. Nevertheless, 
it points out the important fact that t,, varies with temperature in the 
same way as Ap and therefore is subject to control in spiking studies. 
Another constant subject to convenient control is ¢, , since 1/¢, is pro- 
portional to the pump intensity. Even when flash lamps are used for 
pumping it is still approximately valid to assume ¢, is constant, since the 
time constant for the flash will usually be much longer than the interval 
between spikes. To assure that this is true, it would be advantageous to 
have the spike pattern commence when the flash is at its maximum in- 
tensity. It is not valid to assume without investigation that 1/t, is 
proportional to the total energy dissipated in the flash. Spiking data 
should always include a record of the flash as a function of time and an 
indication of when the spike pattern occurred. Also subject to experi- 
mental control is ¢, , the photon lifetime in the laser mode of the cavity. 
Presumably the losses in a good laser cavity can be estimated rather re- 
liably, so that ¢, can usually be directly calculated. The total number NV 
of active laser atoms can ordinarily be determined in a given sample, but 
N does not appear to be a convenient parameter to vary in laser experi- 
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ments. The signal or source strength s is best determined from an analysis 
of the spiking data as described in this paper. Although s = s(¢) will in 
general be a function of time, it will be shown that the spiking pattern 
depends only on the value of s at the instant when the net losses in the 
laser mode vanish due to the buildup of inversion. We shall denote this 
time by ¢, and the critical inversion by n; , where 


N= taf ts (8) 


is the celebrated Schawlow-Townes’ criterion for the buildup of laser 
oscillation. 

We now introduce dimensionless variables and parameters; in terms 
of the variables 


ee 
7 = Utp/tm) = n/n (9) 
p = p(4 to/tm), 
and the parameters 
w = ty/to 
E = no(tp/tm) = No/m (10) 


Cc = s(a tpto/tm), 


the rate equations (1), (2) become 


p= Pao ptm (11) 
: dn 
n= a = o(E — 1 — pn). (12) 


Here p represents the photons, 7 the znversion, and 7 the tzme, while w 
represents the pumping rate, — the limiting inversion toward which the 
pump is tending to drive the system, and o the source. We see that there 
are really only three parameters in the rate equations; it follows that 
three relationships among the six relevant physical parameters with 
which we started can be obtained from spiking studies. Although we 
shall assume in our analysis that w and é are constant, our results will 
provide a useful adiabatic approximation for the case of slowly varying 
w, & Typical values of the parameters for a ruby laser will be given in 
the discussion of a numerical example. For the present we need only 
mention that o is relevant only in the initial growth of p prior to the on- 
set of laser gain. 
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If we regard o asa constant, the steady-state solution of (11) and (12) 

is 

(E+ o —1) + (E+ 0 — 1)’ + 4o}} 

S(E +o +1) — (E+ 0-1)’ + dof}. 


The sign of the radical is determined by the requirement that p = 0. 
In the limit « — 0 we obtain two cases, depending on whether ~ < 1 or 
&>1.Foré <1 


I 


Poo 


(13) 


I 


Neo 


po —> o/(1 — &) 
oa E <1; (14) 
Neo —> &(1 — pw) 


this is the case in which the limiting inversion mp is less than n, given by 
(8), and laser oscillation does not occur. For & > 1 


pee 
ore) E> 1; (15) 


No —> 1 


this describes the steady state of laser oscillation, which is usually ap- 
proached through a series of sharp pulses in p(7) called relaxation oscilla- 
tions, or spikes. 


II. THE NATURE OF THE SPIKING SOLUTIONS TO THE RATE EQUATIONS 


In this section we shall consider the nature of the solutions to (11) 
and (12) when é > 1 and the initial conditions on p and 7 correspond to 
very few photons and a small inversion n <n, . This may be contrasted 
with the situation in the giant-pulse laser in which immediately after 
switching n >> n,. We shall also assume the spiking condition 


wE <1, (16) 


which will be satisfied whenever spiking can be observed. It will be ap- 
parent later that when (16) is not satisfied there will be no spikes, but p 
will smoothly approach p., . This is the case in gas lasers, where the pump- 
ing rate 1/t, has to be very high to overcome the high relaxation rate 
1/t; ~ 10° sec’. 

In view of (16) and (12), (7) will increase slowly with time and p 
in (11) can be neglected; thus we have 


p(t) & p(r) = o(7)/C1 — 2(7)) (17) 


until 7 approaches 7; and 7 approaches unity. The behavior of p(7r) is 
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shown in Fig. 1 for the initial condition p(0) = 0. Initially p rises with 
slope «(0) and asymptotically approaches the adiabatic solution f(r) 
which it follows for a relatively long time until 7 — 7,. It follows that 
the initial condition on p(7) is unimportant. As 7 passes through 7,, 
where 


a(m) = 1, (18) 


p(7) remains finite and is no longer given by (17); we then leave the 
adiabatic phase and enter the spiking phase of the time development of 
p(7r). According to (11) the slope of p(7) at 7, is o(71) = o1. Thus we 
construct an approximate solution by smoothly joining (17) to a line 
of slope o1 as shown in Fig. 1. It follows that 


p(71) = pr = 201/(w)', (19) 
where 
PS a. (20) 


This is our first important result. It shows that the source occurs in 
spiking theory only as a parameter, the single value o,. Thus we may 
drop the subscript on o and let o; = o, a constant. 

Once we enter the spiking phase, n(7) remains close to unity, fluctuat- 





7 


T_ 


Fig. 1 — The adiabatic phase of the growth of p(r). The curve is the adiabatic 
solution (17), and the line segments of slope o are constructions to approximate 
p(r) near 7 = Oand?s = 71 wherey = m = 1. 
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ing up and down and eventually settling down to its steady-state value 
No = 1. Thus we make the approximations 

n—-12I1n7, (21) 
and 


E16 + et — 2). (22) 


The rate equations (11), (12) can now be written 


inp = Inq (23) 
7 ng = w(8 — Elny — p). (24) 
B 


Since the inversion is not directly observed we eliminate In y from (23), 
(24); the result is most conveniently written 


Y = w8(1 — e*) — wth (25) 
in terms of the logarithmic light output 
W = In (/6). (26) 


The discussion of (25) is greatly facilitated by a mechanical analogy 
which is shown in Tig. 2. We regard W as the coordinate of a particle of 
unit mass moving in a one dimensional potential field. 


Vv 
—op [ (1 — e”) dw 


= wh(e’ — VW — 1). 


y 
(v) (27) 


There is also a dissipative resistive force wéW as if the particle were mov- 
ing through a viscous medium. For the moment let us disregard the 
viscous force, in which case the total energy / of the particle is con- 
served 


E = V(¥) + 3’. (28) 


The particle executes a periodic motion between extreme points V,, < 0 
and Wy > 0 such that 


V(¥n) = V(¥u) = LE. (29) 


In the spiking phase it is permissible to neglect exp (V,,). From (27) 
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Fig. 2 — The spiking phase of the time development of p(r) considered in terms 
of a mechanical analogy in which a particle with coordinate © = In (p/8) moves in 
a potential V(W) given by (27). The extremes of the motion Y,, and Vy, are deter- 
mined by the total energy EF of the particle. 


and (29) we then obtain a simple relation between V,, and Vx 
Vy — Vn, = exp (Wu). (30) 


This is our second important result. 
Let us define 


Wi = In (91/8); (31) 
then it follows from (28) that 
Vn = VW — (01/p1)?/2a8, (32) 


since ¥, = o1/p,. It will be clear from the numerical example in the next 
section that the second term can be neglected, and we can write 


Vn ~ VY . (33) 


This says that the small kinetic energy of the particle at the start of the 
spiking phase can be neglected. 

It is convenient to denote the successive times when n(7r) = 1 by 
™1,72,73,°°* . From (23) we see that these times correspond to ex- 
trema in the motion of VW; according to this convention the successive 
minima and maxima of W are 

minima: (%,),W%3,%,W,--- 


(34) 
maxima: VW, ,W%4,We, °° 


1516 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


We have placed VY, in parentheses, since it is a minimum only in the 
mechanical analogy and not in the observed light output. In the absence 
of damping, of course, we would have % = W = W; = ---, and 
Y= Ww = & = ---. If (16) is satisfied, the damping will be small 
enough so that the motion is approximately periodic, or quasi-periodic. 
Thus to a good approximation we can compute the first maximum WV. 
from VW, by means of (30). Before considering the damping we shall con- 
sider other quantities of experimental interest which are characteristic 
of the periodic motion. 

The maximum velocity Vnax = Vo is capable of being measured experi- 
mentally from spiking patterns in which the spikes are well resolved in 
time. It follows from (28), (29) and (383) that 


Wimax = Wo = [2w8(e _ Wy a L)}* 
2’ [208(—W, — 1)]'. 


Vy 


(35) 
In numerical applications this can be used to ascertain the validity of 
(21), since according to (23) 

Vo = In Nmax © NYmax — 1. (36) 


In general we can write near VY, 


1 


© = (22 — Vw)" 


; (37) 
Pe Ras — Wa) 
thus the time dependence near a minimum is given by 
Wir) SW + $oB(7 — ri) (38) 


Let us denote by m the full width in time of the minimum measured 
between points e times the minimum in light output; this is the same as 
the full width of Y(7) measured between points V,, + 1. Thus the dura- 
tion of the minima according to (38) is 


m = (8/w8)’. (39) 


This applies to all minima regardless of damping, which provides a very 
convenient means for determining almost by inspection whether or not 
a spike pattern is consistent with the assumption of constant w8. Even 
if the minima are not observed to have the same durations, it may be 
meaningful, in the sense that our theory provides an adiabatic approxi- 
mation, to apply (39) to each minimum separately and deduce the varia- 
tion of wf. 
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Near the maximum W we write instead of (37) 


W & [2u8(e"? — 1)(W — v)} 


: (40) 
wd) [2wB (Ve —-WYy- 1) (% —_ V)I’, 
where use has been made of (30); thus 
W(r) WH — 4uB(W — Wy — 1) (7 — 7)”. (41) 


The duration M of a maximum will be defined as the time interval meas- 
ured between points at 1/e times the maximum in light output; this is 
the same as the interval between points at Vy — 1. We see from the 
factor (% — WY, — 1) in (41) that MW will depend upon damping; it 
may be written in a general way 


M = m(Vy — Vn — 1)7%. (42) 


There is in this relation a certain ambiguity which is inherent in our 
method of regarding the motion as quasi-periodic. In applying the rela- 
tion we may wonder whether the minimum is the one preceding or fol- 
lowing the maximum. Within the accuracy of the quasi-periodic approxi- 
mation it makes no difference: either may be used, or the average of the 
two. 

The most readily observed quantity in spiking experiments is the in- 
terval between spikes, which can be identified with the period of the 
quasi-periodic motion 


I= $ dv /y, (43) 


where the integral is over one cycle. To evaluate J we use the approxima- 
tions (37) and (40), which are accurate near the turning points WY, and 
WV, respectively where 1/¥ is large. Upon comparing (35) and (37) we see 
that (37) is reasonably accurate even at Y = 0 providing |%;| > 1. 
However, (40) is only accurate near WY , say in the range 


We a 1 < WV < WW . 
Thus we shall use (37) in the range ¥; S VW S W, and (40) in the range 


Vv. < YW S \%, where ¥, is a crossover point which will be determined 
presently. The integral (43) can now be carried out to obtain 


(WV, _ v,)} 
(% — Wy — 7 





10%.) © m| (te = wo)! + (44) 


Since both our approximations tend to underestimate 1/¥, we must 


1518 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


choose Y, so as to maximize J, which gives the condition 
Y= — 1. (45) 
Thus the interval I between spikes is given by 
IT = m[(Vu — Yn — 1)? + (Ou — Yn — 1) 


M(Vu — Yn). 


It is logical in this case to choose the minimum between the two maxima 
between which J is measured. There is still an ambiguity, however, in 
the choice of maxima. Since our approximation tends to underestimate 
I, it is good to use the larger of the two maxima. 

We now return to the equation of motion (25) and consider the damp- 
ing force —wé. If (16) holds, the damping will be small, and can be 
computed from the work done per cycle against the damping force. 


W = wt § Wd. (47) 


With damping present, energy is no longer conserved, but decreases by 
W every cycle of the motion until the particle eventually settles down at 
its equilibrium position Y = 0, corresponding to the steady-state light 
output given by (15). Near VY, we can neglect e”, so that (27) gives 


We evaluate (47) just as we did (48), using (37) and (40) with a cross- 
over point y, , determined this time by the condition that W should be 
a minimum; the result is 


W = (49/2/8) (w€) (w8)*[(War — Wn — 1)8 + (Yar — Vn — 1)*]. (49) 


Let us denote damping by the increment AY,, between successive minima, 
or AV, between successive maxima. From (48) and (49) the general 
formula for damping of minima is 


AVn = (49/2/3) wt (cB) *[(Yar — Yn — 1)8+ (Yar — Yn — 1)4]. (50) 


The choice of V,, is ambiguous, but Vy refers to the maximum between 
the two minima of AY,, . It is obvious from the shape of the potential 
V(¥) shown in Fig. 2 that the maxima will be less damped than the 
minima. For small damping we have 


wB(e*? — 1)\(% — Mu) = W, (51) 


where W is now computed by integrating (47) from WY around the cycle 
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and back to % . The result is 
AV y = (49/2/38) wt(o8) “(Ya — Yn — 1)? + (Yar — Yn — 174] (52) 
with the familiar ambiguity in choice of Va,. From (52), (46) and (39) 
we obtain the very convenient formula 

AV yy = 3(wé) I (53) 


relating the damping directly to the interval. We note that all ambiguity 
has disappeared from (53). 

We now have a complete arsenal of formulas with which to attack ex- 
perimental spiking patterns. Our formulas give all of the minima and 
maxima of WV as well as the durations and intervals and the maximum of 
in terms of the dimensionless rate equation parameters w, & o and 
8 = & — 1. These formulas are valid in the spiking phase where 


Vy — Vy = 1 > 1. (54) 
As the spikes damp out the solution finally enters the small-signal phase 
WV Moo Win —_= 0. (55) 


5,6,13,22,29,31 


The small-signal solution is well known, so there is no need 


to discuss it here, but we give it for ready reference: 
n(r) = 1 + Ae?***[Q cos (Q7 + v) — 4wé sin (Q7 + ¢)] 
p(t) = Bl + Ac? sin (Qr + ¢)] (56) 
Q = w8 — }(wk)” 


ll 


where A is an arbitrary real small amplitude and ¢ is an arbitrary real 
phase. Exactly the same small-signal solution is obtained from (25), 
thereby justifying the approximation (22). It is now easy to see that 
when (16) breaks down the frequency © of the small-signal solution be- 
comes imaginary and there are no oscillations. This may be the case in 
the gas laser, where we may have &~ 1, 8 <1, and w > 46. 


IV. A NUMERICAL EXAMPLE 


Before attempting to apply our formulas to the analysis of spiking 
patterns we wish to discuss their accuracy with the aid of a machine 
calculation. For numerical integration the rate equations (11), (12) are 
best written in the form 

a boas. a; irony y 
£ = w(& — 2x — ze”) 57) 
y=oe* +2x-1, 
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where 
L=y7 
(58) 
y = Inp. 


As already pointed out following (17), the initial condition on x,y is not 
critical; it is convenient to start the solution on the adiabatic solution 
(17), so we take 


y(0) = Ina, x(0) = 0. (59) 
The constants will have the values 
w= 7.12 X 10° 
g=5 (60) 


2x 10° 


om 


which are typical for a ruby laser. The numerical integration of (57) 
has been performed on the IBM 7090 computer using Hamming’s” 
predictor-corrector method. The program provides for automatic halving 
and doubling of the integration interval under the control of the frac- 
tional error of the increment and a preselected tolerance (5 X 107°). 
The stability and accuracy of the program for this problem were checked 
using initial conditions which lead to the small-signal solution (56). 

The results are shown in Fig. 3. The solid curve and scale on the right 
give y(7), while the dotted curve and scale on the left give x(7) for + 
in the range 3 X 10° to 4 X 10*. The origin for 7 is completely arbitrary. 
From (26), (58) and (60) we have 


y=V+np = + 1.387. (61) 


We are concerned primarily with y, since the inversion is not ordinarily 
observed experimentally. The main features of the computed results are 
summarized in Table I, which lists the values of 7, , yn, and m, or M,, 
forn = 1, -:- , 9 for the first nine extrema in the notation of (34). These 
values were obtained from the computed points by fitting a parabola to 
the three points nearest the extremum. The spacing of the computed 
points was sufficiently small (Ar = 0.005 X 10°) that in all cases all 
three points lay well within the validity of the parabolic approximations 
(38) or (41). 

We shall now attempt to calculate the information of Table I by the 
formulas of the preceding section. In every case we shall indicate the 
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Fig. 3 — A machine calculation of y(r), solid curve with scale on the right, and 
x(r), dotted curve with scale on the left, satisfying the rate equations (57) for 
wo =7.12X105&=5,¢6=2X 10°. The important results are summarized in 
Table I. Here x(r) represents the inversion and y(r) the logarithmic light output. 


TasLE I—Computep Resuits oF NUMERICAL EXAMPLE 
Summary of results of machine solution to the rate equations for w = 7.12 X 
10-6, —€ = 5, o = 2 X 10~. Also obtained were tmx — 1 = 0.02830, Ymax = 
0.0282. 





n ™ Yn Mn Mn 

1 3.13844 X 104 —14.570 — 

2 3.2521 4.240 0.01416 * 10! 
3 3.3642 —12.815 0.0532 

4 3.4806 4.179 0.01415 

5 3.5906 —11.795 0.0529 

6 3.7013 4.053 0.0151 

7 3.8100 —10.986 0.0531 

8 3.9186 4.064 0.0152 

9 4.0239 —10.303 0.0531 | 


ED 
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correct value as computed by machine on the right in parentheses. Ac- 
cording to (19) 
pr = 7.50 X 10° 
(62) 
yi = In pi = —14.09 (—14.57). 


Thus we overestimate y; by 0.48 or 3.3 per cent. From (61) the theoretical 
value for ¥; is 


WY, = —15.48. (63) 
Putting V, = VW, and Vy = WV in (30) gives 
VW = 2.91, (64) 
or 
yo = 4.30 (4.240). (65) 


Thus we overestimate yo by 0.06 or 1.4 per cent. For the excursion of y 
we obtain 


2 —- I = We —V, = 18.39 (18.81), (66) 


which is an underestimate by 0.42 or 2.2 per cent. Henceforth we need 
not consider VY, but only y. From (50) using Va — Vn, = yo — yr we 
obtain 


Ys — Yr = 0.965 (1.355) (67) 
yg = —13.12  (—12.815). (68) 


We are underestimating y3; by 0.30 or 2.2 per cent. From (52) using 
Va — Yn = Yo — Y3 we obtain 


Yo — ys = 0.0541 (0.061) (69) 
ys = 4.25 (4.179). (70) 


I 


By repeating steps (67) and (69) we could obtain values for any number 
of succeeding maxima and minima 


Ys — Ys = 0.881 (1.020) (71) 
Ys = —12.24 (11.80) (72) 
ys — Ye = 0.0523 (0.126) (73) 
Ys = 4.19 (4.053). (74) 


We observe from Table I that ys — yes is unusually large (0.126), 
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while yg — ys is negative (—0.011); the average of these is (ys — ys) = 
(0.058) in better agreement with the machine-computed y. — ys and 
with theory. Thus we suspect that the discrepancy in (738) is not sig- 
nificant. There is apparently a slight inaccuracy in the computed solu- 
tion on the third and fourth spike. One might expect that noise of 
various kinds would have a similar effect on real lasers, making the damp- 
ing from spike to spike unreliable. Averaging over several spikes, how- 
ever, would still give a meaningful value for Ay , as it does in our com- 
puted spike pattern. 
The durations of all the minima should be given by (39) 


m = 0.0526 X 10' —- (0.0531 X 10°). (75) 


We see in Table I that m is constant within 0.4 per cent and the average 
over the four minima is 0.0531 X 10°. The durations of the maxima will 
be calculated from (42), using for Vx, — W,, the values of the excursions 
just calculated, with the minimum preceding the maximum 


M, = 0.0127 X 10° ~— (0.01416 X 10°) 
M, = 0.0130 (0.01415) (76) 
M, = 0.0134 (0.0152). 


Here we have underestimated the spike durations by 11-12 per cent. 


The intervals are given by (46) 
Te4 = 0.226 X 10° ~— (0.2285 X 10°) on 
Ti6 = 0.220 X 10° — (0.2207 X 10°). 


The agreement here may be considered perfect. According to (53) we 
should have 


Ayu/I = 2.37 X 10° (78) 
for all consecutive spikes; the computed results are as follows: 
nel (2.66 X 107°) 
4 —> 6 (5.70 X 107°) 
a (79) 
6-8 (0.51 X 10”) 


1[(4 > 6) + (6—8)] (2.63 X 107°). 


The last line is considered more significant than the discrepancies in 
the third and fourth lines. Thus our theory underestimates Ayy,/J by 
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11 per cent. Finally, from (35) 
Ymax = Cmax == ] = 0.0288 


(80) 
(0.0282) (0.0283), 


which shows that the approximation (21) is excellent. 

We may conclude from this confrontation between our formulas and 
a machine computation that the accuracy is probably adequate for the 
analysis of experimental spike patterns. 


V. APPLICATION OF THE FORMULAS 


Experimental data on spiking are ordinarily obtained in the form of an 
oscilloscope trace proportional to the light output. The trace is propor- 
tional to p(t), the number of photons in the laser mode of the cavity, 
but it is usually considered impractical to calibrate the equipment so 
as to obtain the absolute value of p(t). And even if p(t) could be meas- 
ured absolutely it would not fix the absolute value of (7) defined in 
(9). We shall assume that only relative values of p(7) can be measured. 
The most significant measurements of p are the peak-to-valley ratios 


R = pu/pm, (81) 


the ratio of the maximum light output of a spike to the minimum output 
in a neighboring minimum. In analyzing a spike pattern with many 
spikes, some convention must be adopted on which minimum to choose. 
Irom (26) and (61) 


Thus the excursions of the model particle in the potential V(W) are 
uniquely fixed by the data. It follows from (80) that both Vy and V,, 
are fixed by R 


Vy = In (In Rk) 


(83) 
Vn = In (Ink) —InR. 


Even W, , the hypothetical minimum not actually observed, can be de- 
termined by extrapolating the other minima Y,, . Thus all of the extrema 
may be regarded as immediately fixed by the data. From the extrema 
alone we obtain 2 relations satisfied by the three parameters w,é,c, 
namely (19) and (50) or (52). 

It is now clear that a single spike pattern does not contain enough 
information to determine w,£,o. The reason for this is that we cannot 
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make use of the measured durations or intervals, since we do not know 
the ratio t, between real time ¢ and dimensionless time 7. We might 
think that we could obtain information from time ratios such as (17/m) 
or (m/I). It turns out that we can, but it is a different kind of informa- 
tion, the kind we have called ‘qualitative’ in Section I. Consider the 
three quantities 


A = (M/I) MR (84) 
B = (m/I)[(n R — 1)? + (InR — 1)7 (85) 
C = (AV,,/AY,,)(In R — 1), (86) 


all of which can be determined immediately from the data since they 
depend only on ratios. It is to be expected that C might have to be 
averaged over several spikes to get a meaningful value. Except for this 
difficulty, values of A, B, and C may be calculated for every spike in 
the data after adopting some convention to handle the usual ambiguity 
in the definitions (84), (85), (86). One such convention is illustrated 
by the following example: 


In Rp = WH — Vs 
A, = Lo/ 49-4 2 
(Mo/Io.4) In R : (87) 
By = (m3/TIo-4) [Un Re — 1)? + Un Re — 1)7] 
C, => [(V = W,)/(¥ = W,)](In Re = 1). 


Irom (42), (46), (50) and (52) we find the simple relations 
5 a ote ae aa | (88) 


which do not involve the parameters w,£,c. Therefore (88) should hold 
even if w,é are slowly-varying functions of time within our adiabatic 
approximation. We call the relations (88) consistency relations, since 
they can be used to determine whether data are consistent with the 
rate equations. If (88) is satisfied reasonably well, at least in an average 
sense over the spikes, it is a foregone conclusion that a reasonable fit to 
the data can be obtained from the rate equations. The converse is also 
true: if (88) is not satisfied the data cannot be fitted from the rate 
equations. 

The importance of observing the minima (valleys) as well as the 
maxima (peaks) in a spike pattern is abundantly clear. Without the 
valleys we cannot determine R, m, or AY,, , all of which appear in the 
consistency relations. Therefore a great deal of information is lost unless 
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the valleys can be seen above noise. This presents some difficulty, 
because ordinarily R > 1 is much too large to be measured from an 
oscilloscope trace that responds linearly to the light output and contains 
both the peaks and the valleys. In spike patterns the valleys usually 
just correspond to the noise level in the experiment. It is not our purpose 
to go into experimental details except to point out that with care the 
valleys should be observable. The basic experimental requirement is 
that the acceptance cone of the detector should correspond to the radia- 
tion cone of the laser so as to exclude extraneous light from the pump 
and the spontaneous emission of the laser medium. If it is not possible 
to measure the valleys, there is still one consistency relation that can 
be applied to the peaks alone. From (46), (82) and (83) we have 


Ink = 1I/M 


(89) 
—¥,, = (I/M) — In (I/M). 


The use of (46) is equivalent to assuming A = 1. Thus the minima can 
be calculated from I/M if we assume A = 1. We could obtain the 
durations m of the minima by putting B = 1. The damping of the 
minima would not be given very reliably by (89), but can be obtained 
from (86) by putting C = 1. Thus we can deduce R, m, and AV,, from 
the peaks alone, but we lose all of our consistency relations (88). How- 
ever, from (89), (42), (89) and (35) it follows that 


D =1, (90) 
where 
D = (MWax/4)[(I/M) — 1]}*/[(Z/M) — In (1/M) — 1° (91) 


can be determined from the peaks if the time resolution is good enough 
to give a good value for (MWmax). If (90) is satisfied, it is probably good 
evidence that the laser obeys the rate equations, and a rate equation 
analysis is meaningful. However, if all that is desired is to apply the 
rate equations blindly to obtain quantitative information, it is not neces- 
sary to measure MW... All of the quantitative information in a con- 
sistent spike pattern can be deduced from J, M, and AV, . 

We now consider practical ways of obtaining quantitative information 
from spike patterns. The one-patiern method is to measure tp by an inde- 
pendent experiment. The measurement of the cavity losses has been 
discussed by several authors.” Suffice it to say here that ¢, can be 
measured from the dependence of the threshold flash energy for produc- 
ing laser action on the temperature and on losses deliberately introduced 
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into the laser mode. Once ¢, is measured all of the times m, M and I 
become known in dimensionless time. From (389) 


wB = wo(& — 1) = 8/m’, (92) 
and from (53) 
wk = 3AW,,/T. (93) 
These equations can be solved for w,é 
E = ym/(ym — 1) 
(94) 
w = 8(ym — 1)/m’, 
where 
y = te(m/I) AV (95) 
is independent of t, . Now o can be obtained from (19) and (89) 
og = 48(w8)*(I/M)e (96) 


with M = M 2g and J = Iy.4. This procedure makes use of the duration 
m of the valleys but not the peak-to-valley ratio R. If only the peaks 
are observed we write instead of (92) 


w8 = w(& — 1) = (8/MTI)/{1 — (M7/F)). (97) 
Solving (93) and (97) for w,é gives 


—€= 6M/(6M — 1) 
(98) 
w = 8(6M —1)/(1 — M)M, 
where 
6 = [1 — (W/D1 AV (99) 


is independent of t, . This kind of analysis can be applied to every spike 
in a spike pattern. It may be found that £ and especially w vary slowly 
from spike to spike, which is permissible within our adiabatic approxima- 
tion. If the data satisfy the consistency relations (88), the same values 
of w,é will be obtained from (94) and (98). The greatest weakness of 
this method is that (94) fails completely if ym S 1; (98) failsif 647 S 1. 
Thus a great deal depends on the accuracy of the formulas as well as 
that of the measurements of t, and y or 6. It follows that the method will 
fail whenever é > 1. 

As an example we shall apply this method to the machine calculation 
considered in Section IV. We consider the solid curve of Fig. 3 to be the 
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data, which implies that ¢, is known and both the peaks and valleys 
have been studied. The relevant numbers have already been given in 
parentheses in (69), (75), (76) and (77); we repeat them here without 
parentheses 


AVy = Yo -— Ws = 0.061 
0.0526 x 10° 


on (100) 
M = M, = 0.0142 X 10 
I = In4 = 0.2285 X 10°. 
From (94), (95) and (96) we obtain 
£ = 34 (5.0) 
w=12X10° (7.12 X 107°) (101) 
¢=10X10° (2X10). 


The lack of accuracy is primarily due to the fact that £ = 5 is a little 
too large to give good results with this method. Using only the peaks, 
we obtain from (96), (98) and (99) 


g = 2.9 (5.0) 
14X10°° (7.12 X 107°) (102) 
8 xX 10° (2X 107°). 


Ww 


Il 


o 


We now describe a two-pattern method which does not require the 
measurement of ¢, . In fact, it yields a value for ¢, and may in some cases 
give better results for w,€ than the one just described. It is based upon 
observing the valleys in two or more spike patterns for which the 
relative values of w and é are known. We must assume that the linewidth 
Av is known as a function of temperature. Let us suppose that we meas- 
ure the valley durations m and m’ in two spike patterns in which the 
ratios (£’/£) and (w’/w) are known. We know (é’/) from the tempera- 
tures at which the patterns were obtained. We can obtain w’/w from the 
relative pump intensities at the times when spiking occurred. It im- 
mediately follows from (92) that 


a (wm?/w'm'’) — 1 
(wm? /w'm'?) — (&/E) ” 


This result is meaningful providing that the data give (wm’/w’m”) lying 
between (é’/&) and unity. Once £ is determined w can be calculated from 


g (103) 
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(92) and (93) 





= 9 (é — 1) (m . 2 
o= 39 2 (7) (AV ar) ? (104) 


and o is again obtained from (96). 

As an example of this method we shall apply it to two machine-calcu- 
lated spike patterns, one of which is that of Fig. 3 with the parameters 
(60), and the other has the parameters 


/ 


Oo = WwW 
a4 (105) 
oa’ = ¢d. 


From the machine-caleculated pattern we find 
m' = m;' = 0.0614 X 10%. (106) 
The value of m = mz; is given in (75). We assume that the ratio 
’/& = 0.8 (107) 


is known from the temperatures at which the data were taken, and the 
ratio 


(wm? /w'm”) = 0.748 (108) 


is calculable from (w/w’) and the two valleys. We now obtain from (103) 
the value 


& = 4.85 (5.0). (109) 
From (104) we now obtain 
wo=90X10° (7.12 X 10°) (110) 


using the values in parentheses from (75), (77) and (69). From (39), 
(109) and (110) we obtain the absolute value of m 


m = 0.048 X 10° ~—- (0.0531 X 10°). (111) 
It follows that ¢, is given by 
tp = Mexp/M, (112) 


where mexp is the measured duration in laboratory time. We conclude 
that the two-pattern method is to be preferred to the one-pattern method 
as a general approach to spike analysis. It should be mentioned that 
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there seems to be no two-pattern method involving only the peaks which 
is sufficiently accurate to give meaningful results. 

The most convenient experiments to perform involve changing the 
pump intensity 1/t, while all other parameters remain fixed. If t, < t,, 
as is usually the case in flash-pumped lasers, ~ will be independent of ¢, 
and w will vary as 1/t, . Varying w does not give quantitative informa- 
tion such as we obtained from varying é in the two-pattern method. 
Nevertheless, it is of interest to consider what effects are to be expected. 
From (30), (26) and (19) we have roughly 


pu ~ —BYm 
~ Bln (BV w6/2c). 


Thus the maxima in p depend only logarithmically on pumping power. 
It follows from (9) that the observed light output proportional to p 
should vary as 


(114) 


je teal 0 ae (115) 


The time intervals m, (7, and I should vary as 


ma« M« I « ty. (116) 


VI. SUMMARY 


We have now outlined a comprehensive program for the study of lasers 
by means of their spiking patterns. The rate equations have been formu- 
lated in terms of the light output and the atomic inversion and five 
physical parameters in (1) and (2). We have written the equations in a 
general form valid for three- and four-level systems and taking into ac- 
count statistical weights and thermalization of the upper laser level. 
These equations were then reduced to dimensionless form in terms of 
three parameters in (11) and (12). All of the properties of the spiking 
solutions of experimental interest were then deduced analytically by 
means of the mechanical analogy shown in Fig. 2, in which a particle 
moves in a potential well in a viscous medium. 

The formulas obtained were illustrated and tested for accuracy against 
a machine-computed solution to the rate equations. The value of the 
formulas was confirmed by this comparison, and we went on to discuss 
their application to experimentally observed spiking patterns. Four rela- 
tions were given whereby spike patterns can be tested for consistency 
with the rate equations. These consistency relations do not contain any 
parameters, only ratios of times and of light outputs. 
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Two methods were described for obtaining quantitative information. 
In the one-pattern method everything is deduced from a single spike 
pattern, but it is necessary to measure the photon lifetime t, in the cavity 
by independent experiments. It is possible to apply this method to data 
in which only the peaks are observed. It is emphasized, however, that 
the observation of the valleys in light output should be perfectly feasible 
and very much worthwhile. In the two-pattern method all the parameters 
and ¢, are deduced from two patterns obtained at different temperatures. 
In this method, the more accurate of the two, it is essential that the val- 
leys be observed. 

Our objective has been to provide the researcher with a set of tools for 
applying the rate equations to experimental spiking data. The spiking 
phenomenon in solid state lasers can be utilized in research now that it is 
beginning to come under good experimental control. We have tried here 
to point out what kind of spiking data is needed, and what additional 
information is needed, to get the maximum information from spiking. 
Our analysis applies to the sharp spike region of time in which the rate 
equations are highly nonlinear. It complements the well known small- 
signal analysis in which the rate equations become linear. The entire dis- 
cussion is based upon the rate equations of Statz and De Mars, which we 
regard as reasonable, relevant and widely accepted. We have not gone 
into the derivation of these equations or the modifications that have been 
proposed or might be proposed. We prefer to leave that to the future, 
when we may reasonably expect that systematic spiking studies will 
have clearly revealed the adequacy or inadequacy of these equations, 
and indicated the direction which these modifications must take. 
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Design of Wideband Sampled-Data Filters 


By R. M. GOLDEN and J. I’. KAISER 
(Manuscript received March 6, 1964) 


A design procedure is presented for readily obtaining sampled-data filter 
representations of continuous filters. The procedure utilizes the bilinear z 
transformation and preserves the essential amplitude characteristics of the 
continuous filter over the frequency range between zero and one-half the 
sampling frequency. It is shown that the procedure can yield meaningful 
sampled-data filter designs for many of those filters where the standard z 
transform cannot be used directly. 


I, INTRODUCTION 


Sampled-data filter representations for continuous filters can be ob- 
tained using several different design procedures.! A particular design 
method utilizing the bilinear transformation is developed herein. The 
method is especially useful in designing wideband* sampled-data filters 
which exhibit relatively flat frequency-magnitude characteristics in suc- 
cessive pass and stop bands. Vilters of this type are widely used in net- 
work simulation and data processing problems.” The design method pos- 
sesses two chief advantages over the standard z transform.’ The first is 
that the transformation used is purely algebraic in form. This means it 
can be applied easily to a continuous filter having a rational transfer char- 
acteristic expressed in either polynomial or factored form. The second 
advantage is the elimination of aliasing’ errors inherent in the standard 
z transform. Thus, the sampled-data filter obtained by this design 
method exhibits the same frequency response characteristics as the con- 
tinuous filter except for a nonlinear warping of the frequency scale. 
Compensation for this warping can be made by a suitable frequency 
scale modification. Some of the more common filter networks to which 
the design method can be applied effectively are the Butterworth, Bessel, 
Chebyshev, and elliptic filter structures. 

The essential properties of the bilinear transformation are presented 


* A sampled-data filter design will be termed ‘‘wideband”’ if the frequency range 
of useful approximation approaches half the sampling frequency. 
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in the next section. For comparison purposes, properties of the standard 
z transform are also given. This is followed by a detailed description 
of a filter design procedure using frequency transformations. Examples 
illustrating the design procedure are then presented. A short discussion 
concerning computer simulation of the obtained sampled-data filters is 
also included. 


II. THE STANDARD AND BILINEAR Z TRANSFORMATIONS 


In this section it is assumed that a satisfactory rational expression is 
known for the transfer function of a continuous filter for which a sampled- 
data approximation is sought. What is then necessary is a transformation 
which will convert this transfer characteristic into a sampled-data trans- 
fer function rational in 2~, the unit delay operator. 

Two transformations applicable to this problem are the standard z 
transform and the bilinear z transformation. 

The standard z transform applied to H(s), the transfer function of the 
filter, is® 


i] 


H*(s) = 2) H(s + jmay) (1) 
or equivalently in terms of the impulse response, h(t), of the filter 
ge*(2) = T DL A(IT) 2" (2) 
i=0 
where 
s=ot+ Jw 
H(s) = Laplace transform of h(t) 
w, = 2r/T = radian sampling frequency 
H*(s) = Laplace transform of the sampled filter impulse re- 


sponse 
2 exp (—sT7') = the unit delay operator 
5¢*(z) = H*(s) |,=cuna/r = 2 transform of h(é). 


=1 


The behavior of H(s) for s greater than some critical frequency jw, is 
assumed to be of the form 


H(s) lan s> jo, K/s", n> 0 (3) 


where K is a determined constant. 
Equation (1) or (2) is the transfer function of a sampled-data filter 
which approximates the continuous filter. In the time domain, the im- 
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pulse response of the sampled-data filter is the sampled impulse response 
of the continuous filter. This can be shown by taking the inverse trans- 
form of (2). Equation (1) shows that in the baseband 


(~w,/2 S w S w,/2) 


the frequency response characteristics of the sampled-data filter, H*(s), 
differ from those of the continuous filter, H(s). The difference is the 
amount added or “aliased’” in through terms of the form 


H(s + jmw,),m # 0. 


If H(s) is bandlimited to the baseband, i.e., | H(s) | = 0 forw > w,/2, 
then there is no aliasing error and the sampled-data filter frequency 
response is identical to that of the continuous filter. Unfortunately, 
when f(s) is a rational function of s, it is not bandlimited and therefore 
H(s) # H*(s) in the baseband. 

The magnitude of the errors resulting from aliasing is directly related 
to the high-frequency asymptotic behavior of H(s) as defined in (3). If 
n is large and w, « w,/2, then the aliasing errors will be small and the 
standard z transform generally will yield a satisfactory sampled-data 
filter design. However, in wideband designs, w, is usually an appreciable 
fraction of w,/2. Furthermore, many continuous filter designs result in 
transfer functions in which n is no greater than 1. These tio conditions, 
namely w, & w,/2 and n = 1, can create large aliasing errors in the 
frequency response characteristics, thus yielding an unusable result. 

Fortunately, even when w, & w,/2 and n = 1, a design method em- 
ploying the bilinear z transformation” may provide satisfactory wide- 
band designs. This z form is defined from the mapping transformation, 


s = (2/T) tanh (s,7/2) (4) 
where 
8; = dj + Jor . 


The right-hand side of (4) is periodic in w, with period 27/T. Consider- 
ing only the principal values of w., —7/T < ow, < 7/T, it is seen that 
the transformation given by (4) maps the entire complex s plane into 
the strip in the s; plane bounded by the lines wo, = —a/T and wo, = 
+2/T. For this reason the bilinear transformation can be looked upon 
as a bandlimiting transformation. Therefore, when this transformation 
is applied to a transfer function H(s), the entire s-plane frequency 
characteristics of H(s) are uniquely carried over into the frequency 
characteristics of H(s;). 


* This transformation will be referred to as the bilinear z form or 2 form. 
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With the substitution 
gz bes et a 
(4) can be written immediately as 


2 (1 — 2") 





s = i a+r) . (5) 
Thus, 
SC(z) = H(s)],_ 2 av (6) 
T (1427-1) 


where 3C(z) denotes a sampled-data transfer function obtained by the 
use of the bilinear z form.” 

The transfer function 3C(z) obtained by means of the bilinear z form 
and the function 3C*(z) obtained by means of the standard z transform 
are both rational in 2~! and of the same denominator order as the con- 
tinuous filter. These two functions, 3C(z) and 3C*(z), become essentially 
equal to each other as the sampling frequency becomes large compared to 
the moduli of each of the poles of the continuous filter function, H(s). 
When the sampling frequency is not large, representation of some filters 
by the standard z transform can be quite unsatisfactory because of 
aliasing errors. However, the bilinear z form, with its absence of aliasing 
errors, may give a satisfactory representation for these filters. A particu- 
lar set of filters to which the bilinear z form always can be applied suc- 
cessfully are those which exhibit relatively flat frequency-magnitude 
characteristics in successive pass and stop bands. This follows directly 
from (6). 

Thus, sampled-data filters designed by using the bilinear z form pre- 
serve the essential amplitude characteristics of the continuous filter. In 
the baseband (—w,/2 S w S w,/2), the frequency characteristics of the 
sampled-data filter are identical to those of the continuous filter except 
for a nonlinear warping of the frequency scale. 

This warping is found from (4) upon substituting jw for s and is 


w = (2/T) tan (7/2). (7) 
For small values of w,, the relation is essentially linear, producing 


* It should be noted that the bilinear transform is used here in a distinctly dif- 
ferent way than it is commonly used in sampled-data control system design. In 
the control system literature it is used to transform the sampled-data function 
5C*(z) from the discrete domain back to the continuous domain for conventional 
stability and frequency response analysis. See for example J. T. Tou, Digital and 
Sampled-Data Control Systems, McGraw-Hill, New York, 1959, pp. 244-247 and 
pp. 466-470. 
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Fig. 1— The frequency scale warping of the bilinear z transformation. 


negligible warping at the lower end of the frequency scale. Fig. 1 shows 
the nature of this warping. Compensation for the effect of warping can 
be made by prewarping the band-edge frequencies of the continuous 
filter in such a way that application of the z-form transformation will 
shift the band-edge frequencies back to the desired values. The incor- 
poration of this prewarping compensation into a sampled-data filter 
design procedure is discussed in the next section. 


Wt. A SAMPLED-DATA FILTER DESIGN METHOD 


A sampled-data filter design may be obtained by applying the z-form 
transformation of (6) to the rational transfer characteristic for a continu- 
ous filter. However, in order to compensate for the frequency warping 
imposed by the z form, the frequency characteristics of the continuous 
filter first must be altered or prewarped. Hence the transfer characteristic 
for the continuous filter must be redesigned such that the band-edge 
(cutoff) and maximum loss frequencies are computed according to, 


w, = (2/T) tan (waT'/2) (8) 
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where: 


w, = computed cutoff or loss frequency 
wa = desired cutoff or loss frequency. 


The redesign of the continuous transfer characteristic cannot be ac- 
complished simply by applying (8) to each pole and zero of the original 
transfer characteristic. On the contrary a completely new transfer char- 
acteristic must be obtained for the continuous filter. It is then possible 
to obtain the desired sampled-data filter by applying the z-form of (6) 
directly to the redesigned transfer characteristic of the continuous filter. 
The sampled-data filter so obtained will then have the desired magnitude- 
frequency characteristics. 

Compensation for frequency warping becomes especially simple to 
apply if the original continuous filter design was obtained by applying 
a frequency-band transformation to a suitable low-pass design such as 
Butterworth, Chebyshev, etc. Thus the extensive literature available on 
tabulated low-pass filter designs can be used to great advantage to sim- 
plify the filter design problem. The well-known frequency transforma- 
tions which convert a normalized low-pass filter to a low-pass, a band- 
pass, a bandstop, or a high-pass design are 


Sn = 8/w, low-pass to low-pass (9) 


_ (8° + wor) 


$= low-pass to bandpass (10) 
8(@y — w1) 


= S(w, = 1) 


(s? + Wy1) 
Sn = o/s low-pass to high-pass (12) 


low-pass to bandstop (11) 





Sn 


where: 


S, = the complex variable of the normalized low-pass filter transfer 
function 
s = the complex variable of the desired filter transfer function 
w®, = the upper radian cutoff frequency 
w, = the lower radian cutoff frequency. 


When continuous filters are designed with the aid of these transforma- 
tions, prewarping is accomplished by properly choosing the cutoff fre- 
quencies used in the frequency transformations. The choice of these cut- 
off frequencies is determined from the desired cutoff frequencies by 
means of (8). Using these values, the new prewarped transfer function is 
determined by applying (9), (10), (11) or (12) to the original low-pass 
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function. Transformation is made to a sampled-data filter by applying 
(5) to the prewarped continuous function. This sampled-data filter will 
now have the correct cutoff frequencies. The transfer function thus ob- 
tained can be used directly in a digital computer simulation. 


IV. SIMULATION OF SAMPLED-DATA FILTERS 


Application of either the standard z transform or the bilinear z form 
to a rational transfer function yields a transfer function rational in z~ 
for the sampled-data filter. The programming or simulation of this 
sampled-data filter on a digital computer can be accomplished by either 
the direct, the cascade or the parallel form. These forms, as commonly 
defined, are shown in Fig. 2. In this figure G(z) and §(z) represent finite 
polynomials in zg for feed-forward and feedback transmissions re- 


amma aoe 
Lise] 





g(z) 
(a) DIRECT FORM H(z) = 1+ §(2) 
-/ 7) ---4_) 
G(2) 
(b) CASCADE FORM ee Tl [+02 | 





Siz) 





(C) PARALLEL FORM JZ) = G,(2) D5 +z) 
11 


Fig. 2— Some possible simulation forms for sampled-data filters. 


1540 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


spectively; where subscripted, the order of the polynomial is at most 
second. 

The choice of which of the three forms to use for simulation of the 
sampled-data filter depends on the complexity of the filter function 
H(s), on the form of H(s), and on the particular z transformation used. 
Generally, simulation by the direct form requires considerably greater 
accuracy in the determination of the filter parameters than either of the 
other two forms. This is especially true when the order of H(s) is large 
and when H(s) has poles with real parts that are a very small fraction 
of the sampling frequency. For this reason either the cascade or parallel 
form may be preferred. 

The choice between using the cascade or the parallel form depends 
largely on which z-transform method is used to obtain the sampled filter 
and how that particular method is applied. Realization in the cascade 
form requires calculation of the numerator polynomial, G(z), or its fac- 
tors. This computation consists of a simple algebraic substitution when 
the bilinear z form is applied to a filter function H(s) expressed in the 
form, 

I] (s -— a) ° 
k=1 
Determination of G(z) in polynomial or product form respectively allows 
either of the following cascade realizations to be made: 








. 1 
5C(z) =e G(z) I] F + bie. +. =) (14) 
or 
a “ {aor + ane + a) 
ae I ( 1 + bye + doz? J - (15) 


If the numerator G(s) is in polynomial form, considerable care must be 
taken in the calculation of the coefficients of the polynomial G(z), as this 
computation involves differencing nearly equal numbers. 

For realization in the parallel form the partial fraction expansion of 
H(s) must be known. Since in the standard z transform method obtain- 
ing the partial fraction expansion is a necessary step, simulation of 
filters designed by this method is most directly accomplished in the 
parallel form. Here the continuous filter transfer characteristic is repre- 
sented by 
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N 
Pus + Po: 
H(s) = ‘ 
( ) 2 Qos? + Ques + Qox 
Transforming this expression by use of the standard z transformation 
yields 


(16) 





N an? 
5C(z) = Dy Anz + Aox 


& Buz? + Buz +1 (17) 


whereas transforming by use of the bilinear z form yields the similar 
expression 


N =I 
se(e) = (1+ 27) 2 Ae Ae 


1 Boe + Bye +10 18) 


Each rational function in either of the above summations can be syn- 
thesized by the recursive structure shown in functional block diagram 
form in Fig. 3(a). This recursive structure uses only two delays, four 
multiplications, and five additions. The complete realization of (18) is 
shown in Fig. 3(b). 









Aoj 
— AMPLIFIER 
~ GAIN = Agj 
zits UNIT DELAY 


Ay.2-1 + Aq: a 

2"! + Ao; 3C(z) = (142-1) LD 3, (2) 
«=! 
J 


+(Z = OOO 
ae) Bo.z-2 + By, z-'+1 
Jj J 
(8) (b) 
TERM IN PARTIAL COMPLETE TRANSFER FUNCTION 
FRACTION EXPANSION EXPRESSED AS A PARTIAL 


FRACTION EXPANSION 


_ Fig. 3 — Simulation in parallel form of a sampled-data filter obtained by the 
bilinear z transformation. 
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The programming of these sampled-data filters for computer simula- 
tion can be greatly simplified if a compiler such as the Block Diagram 
(BLODI) compiler’ developed at Bell Telephone Laboratories is used. 
The compiler permits specification of a sampled-data system in func- 
tional block diagram form. 

In the following section an example is presented for a filter designed, 
synthesized, and simulated by the foregoing method. 


V. DESIGN, SYNTHESIS, AND SIMULATION OF A WIDEBAND BANDSTOP 
SAMPLED-DATA FILTER 


As an example of the application of the bilinear z-form to the simula- 
tion of a practical filter, consider the design of a particular bandstop 
filter. The filter is to exhibit at least 75 db loss in a rejection band which 
extends between 2596 cps and 2836 eps. Below 2588 eps and above 2844. 
cps, the loss is to be between 0 and +0.5 db. The sampling rate of the 
discrete filter is to be 10 kc. The complexity or order of the filter is to be 
held to a minimum. Fig. 4 shows a sketch of the amplitude response char- 
acteristics desired of the filter. 

Minimum filter complexity and sharp transition between pass and stop 
bands suggest the use of an elliptic filter® (equiripple) as the basic low- 
pass type. However, before a suitable low-pass structure can be deter- 
mined, the above specified critical frequencies must be prewarped by 


DECIBELS 





2.596 2.836 
FREQUENCY IN KILOCYCLES PER SECOND 


Fig. 4— Desired amplitude response characteristic of a bandstop filter. 
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means of (8). The warped values are: 


fiy = lower cutoff frequency in passband (2588 cps) = 3364.15 cps 
fir = lower cutoff frequency in rejection band (2596 eps) 

= 3381.13 cps 
fur = upper cutoff frequency in rejection band (2836 cps) 

= 3937.54 eps 


fup = upper cutoff frequency in passband (2844 cps) = 3957.84 eps. 


The warped values at the lower band edge require a low-pass filter 
with a transition ratio of 0.93792, while the values at the upper band 
edge require a transition ratio’ of 0.93658. Therefore, to meet the original 
specifications, the larger of the two transition ratios must be chosen. 
Hence specifications required for the basic low-pass elliptic filter are: 


in-band ripple = 0.5 db 
out-of-band minimum attenuation = 75.0 db 
transition ratio = 0.93792. 


Application of elliptic filter design procedure with these specifications 
yields a basic low-pass structure of eleventh order. The poles and zeros 
for the transfer function of this low-pass filter are listed in Table I. The 
low-pass filter has been normalized to have a cutoff frequency of one 
radian per second and amplitude gain of unity at zero frequency. 


TaBLE I—Poues AND ZEROS OF NORMALIZED HLEVENTH-ORDER 
TiLturetic Low-Pass Fitter 








In-band ripple = 0.500 db 
Minimum attenuation = 76.504 db 
Transition ratio = 0.937917 


Gain factor = 0.0011060 


Poles 
—0.0069130 +7 1.0010752 
—0.0257616 +7 0.9756431 
—0.0615122 +7 0.9063786 
—0.1269215 +7 0.7504391 
—0.2142976 +7 0.4483675 
—0.2611853 


Zeros 
sj 1.0695414 
+7 1.1009005 
bj 1.1946271 
-t7 1.4652816 
+7 2.5031313 
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The desired bandstop filter is obtained next by applying the low-pass- 
to-bandstop transformation, given in (11), to the normalized elliptic 
low-pass filter. The cutoff frequencies used are the warped values ob- 
tained above for fip (8364.15 cps) and fx» (8957.84 eps). The resulting 
bandstop filter is then transformed by the bilinear z form to yield the 
required sampled-data filter. Table IT lists the coefficients of the result- 
ing sampled-data filter needed for parallel realization of the form shown 
in Fig. 3. Fig. 5 shows the frequency response characteristics of this 
sampled-data filter. It is seen that the original filter specifications are 
met by the sampled-data filter. For comparison purposes, the standard 
z transform was applied directly to the twenty-second-order continuous 
filter. The frequency response characteristic of this filter is shown in Fig. 
6. It is seen that the standard z transform has yielded an unusable result. 


VI. SUMMARY 


The need for sampled-data filters in wideband simulations of many 
processing systems has led to a synthesis method which overcomes the 
shortcomings of the standard z transform. The method presented con- 
sists of directly transforming a suitable continuous transfer function to a 
sampled-data filter by means of the bilinear z form. For wideband filters 
the method is particularly suited to those filters that exhibit relatively 
constant magnitude-frequency characteristics in successive pass and 
stop bands. Conventional design techniques of continuous filters are used 


TABLE IJ—PartTIiAL-FRACTION EXPANSION COEFFICIENTS FOR 
PARALLEL REALIZATION OF SAMPLED-DATA 
Banpstor FILTER 


Numerator Coefficients 





Denominator Coefficients 


E 

a Ai Ao Bo Bi 
1 0.0001628 0.0008827 0.9987854 0.1106416 
2 —0.0009283 —0.0001764 0.9989898 0.4285348 
3 —0.0024098 —0.0027894 0.9956089 0.1063723 
4 0.0031774 0.0026966 0.9957459 0.4317548 
5 0.0102446 0.0026026 0.9879911 0.0940731 
6 —0.0037799 —0.0112135 0.9883051 0.4414974 
7 —0.0277640 0.0127415 0.9651789 0.0616261 
8 —0.0108027 0.0289421 0.96614388 0.4663508 
9 0.0272223 —0.1163873 0.8694592 —0.0204564 

10 0.1206914 —0.0054765 0.8742300 0.5186036 

11 0.2973946 —0.2973227 0.5283651 0.2074591 
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DECIBELS 











FREQUENCY IN CYCLES PER SECOND 


Fig. 5 — Frequency response characteristics of the sampled-data bandstop 
filter designed by the bilinear z transformation. 


directly in the synthesis procedure of the sampled-data filters. (Thus the 
synthesized filters have frequency characteristics comparable to those of 
continuous filters.) An example has been presented of a filter function 
synthesized by this procedure and easily programmed for a simulation. 
Results obtained from this example demonstrate the usefulness and ac- 
curacy of the bilinear z-form method. 


DECIBELS 





(e) 
2400 2500 2600 2700 2800 2900 3000 3100 
FREQUENCY IN CYCLES PER SECOND 


Fig. 6 — Frequency response characteristics of the sampled-data bandstop 
filter “ésigned by the standard z transformation. 
REFERENCES 


1. Kaiser, J. F., Design Methods for Sampled-Data Filters, Proc. First Allerton 
Conference on Circuit and System Theory, Nov., 1963, Monticello, Illinois. 


1546 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


. Golden, R. M., Digital Computer Simulation of a Sampled-Data Voice-Excited 
Vocoder, J. Acoust, Soc. Am., 35, Sept., 1963, pp. 1358-1366. 

. Wilts, C. H., Principles of Feedback Control, Addison-Wesley, 1960, pp. 197-207. 

. Hamming, R.W., Numerical Methods for Scientists and Engineers, McGraw-Hill, 
New York, 1962, pp. 277-280. 

. Kelly, J. L., Jr., Lochbaum, C., and Vyssotsky, V. A., A Block Diagram Com- 
piler, B.S. T. J., 40, May, 1961, pp. 669-676. 

. Storer, J. E., Passive Network Synthesis, McGraw-Hill, New York, 1957, pp. 
293-296. 


ao co Se bd 


The ALPAK System for Nonnumerical 
Algebra on a Digital Computer — ITI: 


Systems of Linear Equations and a 


Class of Side Relations 


By J. P. HYDE 
(Manuscript received March 6, 1964) 


This is the third and last in a series of papers describing the ALPAK 
system for nonnumerical algebra on a digital computer. The first paper 
is concerned with polynomials in several variables and truncated power 
series with polynomial coefficients. The second paper” is concerned with 
rational functions in several variables and truncated power series with 
rational-function coefficients. The present paper discusses systems of 
linear equations with rational-function coefficients and a certain class of 
side relations. 

The ALPAK system has been programmed within the BE-SYS-4 
monitor system on the IBM 7090 computer, but the language and concepts 
are machine independent. Several practical applications are described in 
Ref. 1. 

This paper ts divided into six sections. The first two assume that 
the reader has no knowledge of computers or computer programming and 
the last four assume that the reader is familiar with basic computer pro- 
gramming and Refs. 1 and 2. Section I is a general description of ALPAK 
and this paper; Section II discusses the different forms in which a linear 
system can occur, including canonical form; Section III describes the 
ALPAK linear system operations for converting these forms; Section IV 
discusses side relations; Section V describes list naming operations; and 
Section VI discusses possible future developments and improvements. 


I. INTRODUCTION 


This is the third and last in a series of papers describing the ALPAK 
system, a programming system for performing routine manipulations 
of algebraic expressions on a digital computer. The system can perform 
the operations of addition, subtraction, multiplication, division, sub- 
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stitution, and differentiation. The first paper! is concerned with poly- 
nomials in several variables and truncated power series with polynomial 
coefficients. The second paper? is concerned with rational functions 
in several variables and truncated power series with rational-function 
coefficients. The present paper describes the ALPAK facilities for manip- 
ulating and solving by Gaussian elimination systems of equations linear 
in certain variables with coefficients which are rational functions of 
other variables. The facilities for handling a certain class of side rela- 
tions are also described. 

The ALPAK system has been programmed within the BE-SYS8-4 
monitor system on the IBM 7090 computer, but the language and con- 
cepts are machine independent. Several practical applications are de- 
scribed in Ref. 1. 

This paper is divided into six sections, of which the first two do not 
presuppose any knowledge of computers or computer programming 
and the last four assume that the reader is familiar with the basic con- 
cepts of computer programming and Refs. 1 and 2. Section I is a general 
description of ALPAK and deals with basic concepts. Section II de- 
scribes the different forms in which a linear system can occur, including 
especially the canonical form of a linear system. Section III discusses 
the ALPAK linear system operations for converting these forms. 

Section IV describes the way in which ALPAK has been programmed 
to simplify a rational function, using a certain class of side relations. 
The most important relations in the allowed class are of the form X° = C 
(C' a rational function independent of X). This includes in particular 
¢ = —lands° = 1 — c where s and ¢ can stand for sin a and cos a, 
respectively. The simplification is done by a special rearrangement of 
the ALPAK format statement and has certain limitations. 

Section V discusses list naming operations, a convenient set of auxili- 
ary operations for handling arguments of ALPAK subroutines which 
are lists (one-dimensional arrays). Finally, Section VI discusses possible 
future developments and improvements. 


1.1 An Example of the ALPAK Language 


The simplicity of handling linear systems by ALPAK is illustrated 
in the process of solving the following system of two linear equations, 
EQ1 and EQ2, in two unknowns, X/1 and X2, with polynomials in a 
as coefficients. 


EQ1: 3aX/ + 2aX2 —1=0 
EQ2: 2aX1 + 5a°X2 —~-3 =0 
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We first extract a coefficient matrix, SYS, for the equations with —1 
and —3 moved to the right side. 














3a 2a 1 
SYS: ; . 
2a 5a 3] 
The matrix is then put into canonical form using Gaussian elimination.” 
1 —6 + 5a 
—4a + 15e 
SYS: “ 
0 oa Coe, 
—4a + 15a? 


The fact that the original coefficient matrix and the canonical form 
matrix both have the name SYS does not imply that they are equal 
but rather that the latter replaces the former physically in the computer. 
The expressions for the unknowns are then extracted from the coefficient 
matrix. 


_ 76+ 5a 
a —4a + 15a? 
7 
oe —4a + 15a?" 


The following program illustrates how these operations are performed by 
ALPAK. 


SYS SYSRES 2;2 

Reserve space in the computer for 
the physical representation of the 
set of system coefficients which will 
be obtained from two linear equa- 
tions in two unknowns and name the 
set SYS. 

SYSFRM — SYS,(HQ1,EQ2) ,=2 
Extract the 2 X 3 coefficient matrix 
from the equations HQ1 and HQ? and 
place it in SYS. The “=2” says 
that there are two unknowns and the 
third column of the matrix is used 
for the terms of the equations which 
are independent of the unknowns. 

SYSPRT SYS 
Print the system coefficients. 
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SYSCFM SYS 
Put the system into canonical form 
(described in next section) using 
Gaussian elimination. If the system 
were triangular, the row selection 
strategy would cause this to be done 
in the obvious way. The canonical 
system retains the same name as the 
original system. 

SYSSLV (X1,X2) SYS 
Fill X17 and X2 with the solutions 
for the unknowns in SYS. The op- 
eration SYSSLV assumes that SYS 
is in canonical form. 

RFNPRT Xi 

RFEFNPRT X2 
Print X1 and X2. 

X1 
X2 

These are names of single cells in 
memory which will be filled in with 
“pointers”* to the physical repre- 
sentations of the solutions in the 
computer. 


The usefulness of the linear system operations was demonstrated in 
a problem from queuing theory, proposed by L. Takaes,f in which a 
truncated power series of 813 terms was involved in forming a system 
of nine linear equations in nine unknowns. One of these unknowns was 
the third moment of a probability distribution. Its numerator had 
200 terms in five variables with maximum degrees 1, 1, 3, 7, and 9 and 
its denominator had 39 terms in two variables with maximum degrees 
7 and 10. 


II. LINEAR SYSTEMS 


In this section are discussed the different forms of linear systems as 
they are dealt with in the ALPAK context. It is important in writing 
ALPAK programs to remember what these forms are. The next section 
discusses the ALPAK subroutines for changing one form to another. 


* See Ref. 2, p. 795. 
t See Ref. 1, pp. 2090-2092. 
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2.1 System of Equations 


A linear system of m equations in n unknowns, 7; , is a set of m ra- 
tional functions, (1), of v variables (v = n), each of which is linear in 
the x; and is implicitly equal to zero. 


. j=l 


Thus for each 2 in (1) the d;; are the coefficients of the x; and together 
with c; may be thought of as n + 1 rational functions with a common 
denominator. 


2.2 System Coefficients 


Consider (1) written in the form: 
Dt; =e; ©6061 Si < m). (2) 
j=1 


The \,; and the c; of (2) shall be referred to as the system coefficients 
of the linear system (1). In ALPAK they form an array of m(n + 1) 
rational functions stored row-wise and forwards. 


2.3 System Canonical Form 


Let 2a, , *** , Ya, be a subset of the unknowns 2, --- , , which we 
shall call the dependent set, and let 2,,,, +++, 2a, be the remaining 
unknowns, which we shall call the independent set. The dependent set 
is said to be valid if r is the rank of the system and if the associated 
columns of system coefficients are linearly independent over the field 
to which they belong. The system 


n 


Xa; + > Nila; = ¢; (1 S a 


j=r+l1 


IIA 


r) (3) 


and its array of system coefficients are both said to be in canonical form 
with respect to such a dependent set.* It can be shown that for any 
linear system and a given valid dependent set, there exists a unique 
canonical form which is obtainable from the original system and which 
is satisfied by the same values of the x,,. One obtains this canonical 
form by Gaussian elimination; 1.e., operating on the system by suitably 
chosen row operations and column interchanges. t When there is a choice 
of row interchange, the row with the most zero coefficients is selected 
to minimize the work involved. If, in the derived canonical form, r < m, 


* The dependent set in (8) is clearly valid. 
+ See Refs. 3 and 4. 


1552 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


the last m — r rows should be of the form 0 = 0. If they are not, the 
system is said to be znconsistent. If r < n, the system is said to be singu- 
lar. 


2.4 System Solution 


The solution of a linear system is a set of r rational functions, (4), 
of v variables (v 2 n — r), each of which is linear in the 2; (r + 1 
<j Sn) and is implicitly equal to 2, . 


n 


Ya; = Ci —- dy Nia; (lsispr). (4) 
j=rtl 

The solution is easily produced once the system is in canonical form, 

and if the system is nonsingular the solution is of the form 2,, = c¢; 

(lsSisSn). 


III. LINEAR SYSTEM OPERATIONS 
3.1 General Remarks 


In this section are discussed the ALPAK subroutines for convert- 
ing the different linear system forms discussed in Section II. The name 
of a set of system coefficients must be defined by operations SYSNAM 
or SYSRES if it is to be used in any other operations. This name is the 
BSS address of a three-word system heading in which are stored the 
five system parameters. These parameters are the BSS address of the 
system coefficients, the number of equations, the number of unknowns, 
an ALPAK format address, and the number of leading variables in 
this format of which the equations are independent. They are set at 
assembly time by operations SYSNAM and SYSRES or at run time 
by operations SYSSET and SYSMPR. 

The m(n + 1) system coefficient pointers are stored row-wise and for- 
wards, and a block of n + 1 cells must immediately follow to be used 
by ALPAK as work space. In the ALPAK format statement of the sys- 
tem equations, the n unknowns must have consecutive variable numbers 
k + 1 through k + n (k 2 0). If k > O, the system equations must 
be independent of the first k variables, and thus the system coefficients 
are independent of the first k ++ » variables. The system parameter 
fmt is normally this ALPAK format statement and is referenced in 
any system operations involving the names of the unknowns. If it is 
not supplied by SYSNAM, SYSRES, SYSFRM, or SYSSET, all such 
operations must refer to variables by number (VARTYP NUM or 
VARTYP NUMs). 

Those arguments of operations SYSFRM, SYSCFM, and SYSSLV 
which are lists are specified according to the conventions established 
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in Section V. System parameters and system names are not indexable, 
but the addresses where they are stored or to be stored are. As in other 
ALPAK operations, index registers are preserved with the exception of 
index register four. 


3.2 Notational Conventions 


The following conventions of notation are used in descriptions of 
instructions. Upper-case letters are used for operation codes (including 
macro names) and for any parameters which must appear exactly as 
shown. Dummy parameters are indicated by lower-case letters. A dummy 
parameter usually stands for the symbolic address of a cell or block of 
cells in the program where the argument is stored. Those dummy param- 
eters which are the arguments themselves are in boldface. Finally, 
optional parameters are enclosed in brackets, and parameters which 
usually have subarguments are enclosed in parentheses. All integer 
arguments are decimal. By this notation, then, the instructional de- 
scription 


sys SYSRES m,n,[fmt],[k] 


specifies certain properties and restrictions about the arguments of the 
following call: 


COEFF SYSRES 9,9, INDEP 


Thus, only SYSRES must appear exactly as shown and all other pa- 
rameters are dummies with the third one omitted, as it is optional. 
The number of equations is nine, but the number of leading variables 
of which the equations are independent is in the cell whose symbolic 
address is INDEP. 


3.3 Linear System Operations 


sys SYSNAM _ bss,m,n,[fmt],[k] name (a) 
sys SYSRES m,n,[fmt],{k] reserve (b) 
SYSPRT - sys print (c) 
SYSFRM _ sys,(listr),n,[k] form (d) 
SYSCFM  sys,[(listv)],[inc],[ids] canonical form (e) 
SYSSLV (listr) ,sys,|[(listv) ] solve (f) 
SYSOBT __ [(abss)],[(m)],{(n)],[(afmt)], obtain parameters (g) 
[(k)],sys 
SYSSET sys,[abss],{m],[n],[afmt],[k] set parameters (h) 


SYSMPR - sys,[(op oper)], [(op oper)] modify parameters (i) 
[(op oper)], [(op oper)], 
[(op oper)] 


1554 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


sys = name of system (symbolic address of heading) 
bss = BSS address of the array of system coefficients 
abss = address where bss is or is to be stored 
m = the number of equations in the system 
n = the number of unknowns in the system 
fmt = the address of the system’s ALPAK format statement 
afmt = address where fmt is or is to be stored 
k = the number of leading variables in this format statement 
of which the system equations are independent 
listr = list of rational functions (see Section V) 
listv = list of variables (specified in the manner indicated by the 
last previous VARTYP declaration — see Section V) 
(op oper) = a 7090-94 machine operation and an operand separated by 
a blank 
inc = inconsistency return 
ids = invalid dependent set return. 


3.4 Descriptions 
(a) sys SYSNAM bss,m,n,|[fmt],[k] 


Declare a block of length (m + 1) (n + 1) starting at bss to be a set of 
linear system coefficients and work space, and name it sys by reserving 
remotely a three-word system heading. This heading is filled in with 
bss, m,n, fmt, and k. If fmt and/or k is omitted, the corresponding fields 
in the system heading are filled in with zeros. 


(b) sys SYSRES m,n,[fmt],[k] 


Reserve remotely a block of length (m + 1) (x + 1) for a set of system 
coefficients and work space, and name the set sys by reserving remotely 
a three-word system heading as in SYSNAM. sys is to be filled in at 
run time (e.g., by SYSFRM). 


(c) SYSPRT Sys 
Print the set sys of system coefficients. 
(d) SYSFRM sys, (listr) ,n,[k] 


Replace sys by the set of system coefficients formed from the set listr 
of system equations and remove the common factors between the 
coefficients of any given equation (listr is destroyed). The contents of 
n and k and the number of rational functions in listr together with 
their format are copied into the heading of sys. If k is not supplied, it 
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is assumed to be zero. If SYSFRM is not used to fill in sys, the system 
parameters must be filled in with operations (a), (b), or (h). 


(e) SYSCFM sys, |[(listv) ], [ine], [ids] 


Replace the set sys of system coefficients by its associated canonical 
set, using Gaussian elimination. Jstv is a list of unknowns (specified in 
the manner indicated by the last previous VARTYP declaration) to 
be included in a valid dependent set. If léstv is not supplied, the list 
is assumed to be empty. If sys is found to be inconsistent, control will 
be transferred to inc (or to the REMARK subroutine if inc is not sup- 
plied) and sys will have a canonical form with an inconsistency. If the 
set of unknowns in J/zstv cannot be included in a valid dependent set, 
control will be transferred to ids (or to the REMARK subroutine if 
ids is not supplied) and sys will have a canonical form with some subset 
of listv in the dependent set. At inc or ds it is possible to call SYSSLYV, 
SYSPRT, or to go to some other part of the program. 


(f) SYSSLV (listr) ,sys,[(listv) ] 


Replace listr (whose length must not be less than that of listv) by the 
solutions for the list of unknowns listv (specified in the manner indicated 
by the last previous VARTYP declaration). sys is assumed to be in 
canonical form. If lzstv is not supplied, all the unknowns in the dependent 
set are solved for in the order in which they were at the start of SYSCFM. 


(g) SYSOBT [(abss)],[(m)],[(n)],[(afmt)],[(k)],sys 


Obtain the system coefficient parameters of the system whose name is 
sys. Each optional argument is a memory location in whose address 
field the parameter is to be stored. Thus the parameter bss is stored in 
the location abss specified by SYSOBT, etc. Each optional argument 
may actually be several arguments, and if an argument is an integer 
equal to seven or less, it refers to an index register. 


(h) SYSSET sys,[abss],{m],[n],{afmt],[k] 


Set the system coefficient parameters of the system whose name is 
sys from the locations specified by the bracketed arguments. Thus the 
parameter bss is set to the contents of the location abss specified by 
SYSSET, etc. 


(i) SYSMPR _ sys,[(op oper)],[(op oper)] 
[(op oper)], [op oper)], [Cop oper)] 
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Modify the system coefficient parameters of the system whose name is 
sys using the 7090-94 machine operations op with operands oper. Thus, 
the parameter bss is modified by the first operation and operand, m 
is modified by the second, n by the third, fmt by the fourth, and k by the 
fifth. Each operation and operand may be different. Typically, the 
operation is ADD or SUB and the operand is the address of some inere- 
ment or decrement. 


IV. SIDE RELATIONS 


4.1 General Remarks 


The ALPAK programmer may find that expressions involving radi- 
cals occur in his problem. A radical can be handled by assigning it a 
variable name and writing the rational functions using this name. 
Thus in the polynomial a + 2a+/3, we let X = +/3 and the expression 
becomes a + 2aX. The problem is that in the outputs of arithmetic 
operations involving such rational functions, X can have an exponent 
greater than one and the fact that X’ = 3, X° = 3X, X* = 9, --- will 
be ignored. The implicit equation X° = 3 is called a side relation of 
degree two on X. A subroutine is provided for simplifying rational 
functions using side relations of the general form 

x" =C (j an integer = 1) : 
(Ca rational function independent of X). e) 
This category includes especially 7 = —1 and s° = 1 — c’ where s 
and c can stand for sin a and cos a, respectively. 


4.2 Limitations 


Many limitations exist in the present handling of side relations. In 
the relation X” = C,n must be a power of two and X a single variable. 
Dependencies between relations are not observed; i.e., R? = 2and S’ = 3 
and 7” = 6 will not result in the implicit relation of T = +RS. More- 
over, relations are not handled automatically by the lowest-level sub- 
routines, thus causing exponents to grow unnecessarily until simplifica- 
tion is done at main program level. A more sophisticated version of 
ALPAK would prevent this by including the relations as part of the 
format statement. To repair these limitations would require a great 
deal of extra programming, and it turns out in practice that these limita- 
tions do not usually matter. The general problem of dependencies is 
especially difficult, as it involves algebraic extensions of the field of 
rational functions of several variables. 
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4.3 Implementation 


The simplification of a rational function, RF, by a side relation 
X” = C is accomplished with the aid of a specially constructed tempo- 
rary ALPAK format statement. The temporary format is the same as 
the original one except that the exponent field of X is split into two parts. 
The right 7 bits are assigned the name X and the remaining left-hand 
bits form a temporary exponent field which is assigned any name and 
stands effectively for X”. The rational function C is then substituted 
for the temporary variable by the call SEIDREL. If several side relations 
are defined on several variables, then a single temporary format state- 
ment can be used to split up these variables. There will then be a list 
of rational functions to be substituted for the temporary variables by a 
single call to SIDREL (see Section 3.2). 


SIDREL rf, (listv), (listr) ,tfmt 


rf = rational function to be simplified 
listv = list of temporary variables (specified in the manner indicated 
by the last previous VARTYP declaration — see Section V) 
listr = list of rational functions to be substituted for these variables 
and specified in the same order (see Section V) 
tfmt = address of temporary format. 


The format of rf after simplification is the format of the items in lstr, 
or if none of these items has a format, the format of 7f is unaltered. 


4.4 Hxrample 


Suppose it is desired to simplify the function RF using the side re- 
lation J? = —1. This is done by the following program. 
FMT POLCVF (X,5, Y,5,1,5,Z,21) 
Permanent format. 
TEMT POLCVF (X,5,Y,5,1ISQ,4,1,1,Z,21) 
Temporary format with J split up 
into JSQ with four bits and J with 
one bit. 
VARTYP NAM 
POLSTC MON,=-1 
SIDREL RF,ISQ,MON,TEMT 


RE 
MON 
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Testing equality of rational functions R and S which have been 
simplified by a side relation should always be done by subtracting and 
testing for zero as follows: 


RENSUB TEMP,RS 
SIDREL TEMP,ISQ,MON,TFEFMT 
: The side relation applied to R and 
S is applied to TEMP. 
RFNZET TEMP 
Test if TEMP is zero. 


This procedure will recognize that the expressions (1 + 7)/(1 — 7) and 
2 are equal. 


V. LIST NAMING OPERATIONS 


5.1 General Remarks 


Whenever an ALPAK subroutine argument is a list, the list may be 
specified either by actually listing the contents; e.g., 


SYSSLV (P,Q,R),SYS,(X,Y,Z) 
or by name; e.g., 
SYSSLV (LISTP,«), SYS,(LISTV, «) 


Here the asterisk indicates that the list has been specified by name. 
Both methods may be used within the same command; ¢.g., 


SYSSLV (P,Q,R)JSYS(LISTV, +) 


This section describes a set of operations LSTNAM, LSTMAK, and 
LSTRES for assigning names to lists and blocks of storage, thus enabling 
one subsequently to call them by these names in the appropriate sub- 
routines. The operations LSTOBT, LSTSET, and LSTMPR serve as 
auxiliary operations. The facilities are especially useful whenever the 
items of the list are to be filled in at run time or whenever the items 
do not form a contiguous block in core. A list has two parameters, which 
are its BES address and its length. These can be set at assembly time 
by LSTNAM, LSTMAK, and LSTRES or changed at run time by 
LSTSET and LSTMPR (see Section 3.2). List parameters and list 
names are not indexable, but the addresses where they are stored or to 
be stored are. Each item in the specified contents of a list may be tagged. 
Index registers are preserved with the exception of index register four. 
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5.2 List Naming Operations 


ttl LSTNAM _ bes,Ing,[VAR] name (a) 
ttl LSTMAK _ (items),[VAR] make (b) 
ttl LSTRES Ing, [VAR] reserve  (c) 
LSTOBT [(abes) ],[Ung) ],ttl obtain (d) 
LSTSET ttl,[abes], [Ing] set (e) 


LSTMPR _ ttl,[(op oper)],[(op oper)] modify (f) 


ttl = name for list 
bes = BES address of list 
abes = address where “‘bes”’ is or is to be stored 
Ing = length of list 
items = contents of list 
(op oper) = 7090-94 machine operation and an operand separated by 
a blank. 


5.3 Descriptions 
(a) ttl LSTNAM bes,Ing, [VAR] 


Declare a set of items in a contiguous block of length Ing to be a list 
whose BES address is bes, name it ttl, and set the list parameters to 
bes and Ing. If VAR is present, the list is assumed to consist of variables 
(specified in the manner indicated by the last previous VARTYP decla- 
ration). If VAR is not supplied, the list is assumed to consist of rational 
functions, polynomials, ete. (1.e., of symbolic addresses of pointers. ) 


(b) ttl LSTMAK _ (items),[VAR] 


Declare the set whose elements are the subarguments in ztems to be a 
list, name it éé/, and set the list parameters accordingly. VAR is as de- 
scribed in LSTNAM. The items need not be in a contiguous block as 
in LSTNAM. 


(c) ttl LSTRES Ing [VAR] 


Reserve remotely a block of length lng for a list, name it tt, and set the 
list parameters to the BES address of the block and Ing. VAR is as 
described in LSTNAM. The list is to be filled in at run time (e.g., by 
SYSSLYV). 


(d) LSTOBT [(abes) ],[(ing) ], ttl 


Store the BES address of the list whose name is ¢él in location abes 
and store its length in Ing. The bracketed arguments may actually 
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consist of several subarguments, and if an argument is an integer equal 
to seven or less, it refers to an index register. 


(e) LSTSET ttl,[abes], [Ing] 


Set the BES address of the list whose name is ttl to the contents of abes 
and set its length to the contents of Ing. 


(f) LSTMPR ttl,[(op oper)],{(op oper)] 


Modify the BES address of the list whose name is ¢él using the 7090-94 
machine operation op with operand oper specified by the first bracketed 
argument. Modify the length of the list in a similar manner as indicated 
by the second bracketed argument. Typically, the operation is ADD or 
SUB and the operand is the address of some increment or decrement. 


5.4 Example 


The following example shows how list naming can be used to good 
advantage. We are given polynomials (Ai, ---, Ay; y S 15), a set 
of variable names (m,, --+, my; y S 15), and polynomials (F,, --- , 
F,;y = 15). It is desired to form a set of polynomials (G,, --- , Gy; 
y < 15) in the following way, where m;: A; means A; is substituted for 
Mi 


G, 
Ge 


Fi(m :A1) 
Fo(m :Ay M2:A2) 


Il 


Gy = Fy(mz:Ay ,me:A2g, +++ , My:A,). 


Assume that the F’,’s, A,’s, /,’s, and G;’s are stored forwards in blocks 
whose BES addresses are F, A, M and G respectively and that the 
parameter y is in location Y. The following program will perform the 
substitution. 


POLS LSTNAM A,15 
VARS LSTNAM M15 
Define the lists thus setting the list 
parameters to (A,15) and (M,15) 
LSTMPR POLS(SUB Y) (SUB =15) 
LSTMPR VARS(SUB Y) (SUB =15) 
Initialize the list parameters to (A — 
y,0) and (M — y,0). 
LXA ¥aA 
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LOOP LSTMPR POLS(ADD =1) (ADD =1) 
LSTMPR VARS(ADD =1) (ADD =1) 
POLSST (G,1) (FI) (POLS,*) (VARS, +) 
TIX LOOP,1,1 
Increment the list parameters by 
one at each repetition of the above 


loop. 
F BES 15 
A BES 15 
M BES 15 
G BES 15 
Y 


VI. OUTLOOK 


Our experience has shown us that the present handling of linear 
systems has its limitations. Large linear systems are always difficult 
to put into canonical form, and even a relatively simple set of system 
coefficients can grow quite rapidly throughout the course of SYSCFM 
and cause some form of overflow. This growth becomes coupled with 
the growth produced by the greatest common divisor algorithm,* 
thus making the inadequacies of the latter most apparent. The 
success or failure of SYSCFM depends less on the dimensions of the 
system and more on the internal structure and size of the individual 
coefficients. Moreover, it is very difficult to tell by looking at the input 
array whether the structure and size at a later stage of the reduction 
will cause trouble. This difficulty is illustrated in that SYSCFM suc- 
ceeded in reducing a 9 X 9 array with large, apparently complex, en- 
tries, Tf but failed in a related queuing theory problem to reduce a 10 X 10 
array whose entries averaged only two or three terms. It can at least 
be said that there will be no GCD problems if the original array con- 
sists of all rational numbers. 

The subroutine SYSCFM is perhaps too comprehensive. A series of 
orders which would enable one to perform the Gaussian elimination 
method a step at a time, leaving the choice of row and column permuta- 
tions completely up to the user, might be useful. SYSPRT could then be 
called at any time during the reduction. A routine for evaluating de- 
terminants, if available, would enable the solution of nonsingular systems 
by Cramer’s method as an alternative. 


* See Ref. 2, pp. 791-794. 
t See Ref. 1, pp. 2090-2092. 
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The growth problem could be reduced by allowing multiple precision 
polynomial coefficients and by allowing a polynomial to be represented 
as a product of polynomials (not necessarily irreducible). Thus one could 
compute the GCD as a product of simpler GCD’s. To do this would 
require the ability to have a data structure hierarchy in the data buffer 
more complicated than that of a rational function.* This ability would 
also enable a linear system itself to be such a data structure rather 
than an array in the main program. 

A new version of ALPAK (to be called ALPAKB) is now being de- 
veloped. Its foundation is a programming system called OEDIPUS 
(Operating Environment with Dynamic storage allocation, Input-out- 
put, Public push down list, Unhurried diagnostics, and Symbolic snaps) 
which provides for the dynamic storage allocation of such data struc- 
tures, among other things. ALPAKB will also include multiple precision 
integer arithmetic which will handle polynomial coefficient overflow. 
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A Technique for Measuring Small 
Optical Loss Using an Oscillating 


Spherical Mirror Interferometer 


By A. J. RACK and M. R. BIAZZO 
(Manuscript received March 19, 1964) 


The measurement of very small optical losses (the order of a few per cent) 
by conventional methods becomes very difficult because of the extreme ac- 
curacy required. This article shows that both high mirror reflectances 
and low transmission losses can be readily measured using an oscillating 
mirror interferometer as a frequency spectrum analyzer. The theory devel- 
oped shows that when this type of interferometer is excited by a continuous 
gaseous laser, the total optical loss is proportional to the frequency resolution 
or the finesse. The theory also shows that the first-order velocity effect pro- 
duced by having the mirror move at a velocity of one foot per hour can be 
large if the total optical loss is about 0.25 per cent. For the velocities and 
optical losses we have encountered so far in our measurement system, the 
first-order mirror velocity effect can be neglected. The range of reflectance of 
mirrors we have measured is from 94 to 99.5 per cent, and the measurements 
for the optical transmission loss range from 0.2 to 8 per cent. The accuracy 
to which a 1 per cent loss can be repeated is 1.0 + 0.1 per cent. It was found 
that the transmission loss through an optical grade of fused quartz (Homosil) 
at 6328 A is about 1 db per meter, and that for Plexiglas II 1s about 2 db 
per meter. 


I. INTRODUCTION 


In developing a long-distance optical communication system employ- 
ing a large number of components, it is essential to be able to measure 
accurately the optical transmission loss of each component. These 
components may include mirrors with reflectances in the order of 99 per 
cent, Brewster angle output windows, various lenses, and other passive 
elements with optical transmission losses of 1 per cent or less. For such 
small losses, the conventional measuring techniques become increasingly 
difficult because of the extreme accuracy required. 


1563 
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In conventional measuring systems, the loss (or reflectance) is caleu- 
lated by comparing the electrical output of a photodetector for two dif- 
ferent optical conditions, first with the unknown in the system, and then 
with the unknown removed. As the magnitude of the optical loss de- 
creases, this comparison becomes more and more inaccurate, as it 
requires the measurement of a small difference between two relatively 
large photodetector outputs. In these measuring systems, a number of 
methods have been developed to minimize these errors, which are prin- 
cipally optical.! However, it is believed that measuring techniques to be 
described below will more readily measure very small optical losses. 

The proposed method uses a frequency spectrum analyzer at optical 
frequencies. Such an analyzer can be obtained by using a Fabry-Perot 
type of interferometer as a transmission element between an optical 
source and a photodetector.?? When the mirror separation of the inter- 
ferometer is varied periodically by moving one of the mirrors linearly, 
a large photocell output will be obtained whenever the optical cavity is 
in resonance at any frequency that may be present in the optical source. 
If the photodetector output is observed on an oscilloscope whose sweep 
is synchronized with the mirror drive, the scope will display the energy 
distribution of an optical source as a function of frequency. If this type 
of Fabry-Perot interferometer is illuminated by a continuous laser with 
its extremely narrow line output (one or two cycles wide), the linewidth 
of the pattern displayed on the scope is determined by the optical losses 
in the cavity itself, and by the velocity of the moving mirror. If the ex- 
cursion of the moving mirror is several optical wavelengths, the scope 
pattern will repeat several times during a single sweep trace. That is, as 
the mirror separation increases, the mth harmonic of the cavity becomes 
resonant with the source; a short time later, the (m + 1)th harmonic is 
resonant with the same optical frequency, then the (m + 2)th, and so on. 
At each resonant point, there will be several output scope pulses, since 
the laser usually has an output at more than one frequency. The ability 
of any interferometer to separate or resolve two adjacent optical fre- 
quencies is determined by the finesse of the system. In the moving-mirror 
interferometer, the finesse is equal to the ratio of the fundamental pulse 
group spacing to the half-power-height width of any pulse. It will be 
shown below that the total power loss of the Fabry-Perot cavity, ex- 
pressed as a ratio, is equal to m divided by the finesse. 

This interferometer method of measuring small optical losses has been 
suggested on several occasions,‘ but it is believed that this is the first 
time such a system has been so fully developed. Since this system of 
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measurements requires an optical source with an output linewidth very 
narrow compared to that of the optical cavity, loss measurements can- 
not be made over a continuous range of wavelengths, but only at the 
discrete wavelengths at which ew lasers have been developed. Since some 
light must be transmitted through the cavity, the reflectance can be 
measured only for those mirrors not coated with an opaque reflecting 
surface, such as the multiple-layer dielectric coated mirror. 

The interferometer loss measuring technique is very sensitive to build- 
ing and floor vibrations, and to air currents. The reason for this is as 
follows: if the separation between the two mirrors in the cavity is 
changed by one half wavelength, the resonant frequency of the inter- 
ferometer is changed by the natural frequency of the cavity, which in 
our system is about 1 kme, whereas the bandwidth of the cavity for 1 
per cent optical loss is only about 3.0 me. Thus, even very small random 
variations in the mirror spacing in both the laser and in the cavity, pro- 
duced by either vibrations or air currents, will cause the output pulse 
pattern displayed on the scope to have large random time position 
variation. 


II. THEORY 


The relation for which the mirror reflectance and other optical losses 
can be calculated from measurable quantities is derived in the Appendix, 
and will be given briefly here. The effects of the velocity of the moving 
mirror were included in the Appendix. 

Let 


R,, Re. = power reflectance of the two mirrors expressed as a ratio 
g = power loss per single pass through any material within the 
cavity, also expressed as a ratio 

T.. = fundamental pulse group spacing 

T, = half-power output pulse width 
v = velocity of moving mirror 

dy = mirror spacing 
é = time 

X = optical wavelength in free space 
c = velocity of light in free space 


Andy v 
vA ¢ 


x = g(RiR2)’ 


Bo = 
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_ dnt 
r 


go = [g(1 — Ri)(1 — R2)}’. 


An exceedingly good approximation for the time response of the in- 
terferometer appropriate for computer calculation is given by (5) of 
the Appendix: 


ie) 2 ce) 2 
Ip ba gor” cos (an + aun’) | + bP gow” sin (an + aun) | . (5) 


Since z is very nearly unity, a large number of terms must be taken in 
the series. For a maximum error of ¢ in stopping the series after the first 
N — 1 terms, we have 


Il 


, log (2/e) 
Ne (6) 


Tor example 
e = 1/1000 and zx = 0.9975, N = 3040. 


The values of J; shown in Fig. 1 were calculated for the following three 


\ B =0 
Bo =10" a ‘ one o= 
- agate IN Ve x= 0.9975 
By =1078 
x = 0.9975 


1.0 


























Fia. 1 — Interferometer time response as affected by mirror velocity. 
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conditions: (a) 6) = 10°, and a = 0.9975, (b) Bo = 0, « = 0.9975, and 
(c) Bo = 10-° and x = 0.995. The curves given in Fig. 1 show that the 
first-order mirror velocity effect is to decrease the maximum response, 
increase the half-power-height pulse width, and make the response 
unsymmetrical about the maximum response. 

From a number of computed values for (5), of which only a few are 
given in "ig. 1, it can be shown that the first-order effect of the moving 
mirror’s velocity is to increase the half-power pulse width according to 
the relation (see Appendix) 


7 | + 13.8 | (7) 
Tp o~ . (i oa v)4 . 
This relation holds only when 7’, is within a few per cent of 7’, . For 
larger values, the computed results are less than those given in (7). 
Thus, in order to have the width increase by less than 1 per cent, 


Boe 1 
Cig)? = 25° 
This relation can readily be satisfied unless the total loss becomes ex- 
tremely small. In our measurements, the mirror spacing is varied about 
one micron at a 20-cycle frequency. The mirror spacing is about 15 em 
and the wavelength is 6328 A. Then, if « = 0.995 (the largest we have 
measured ), 


v = sty cm/sec & 0.4 ft/hour 
and 


Bo 04x 10° i 


(l—a)? 25x10 62.5° 





Hence, for optical losses of 0.5 per cent or greater, we can neglect the 
first-order mirror velocity effects. 

If mirror velocity effects can be neglected, then the system response 
given by (10) and (11) of the Appendix can be used. The relation be- 
tween the optical loss and the finesse of the system is 


g(RiRy)? = 1 — (eT ,/T.) +4 (aT p/Te) + (11) 


where 7',/T', is defined as the finesse. This equation points out the po- 
tential accuracy with which small losses can be measured. The inter- 
ferometer method actually measures how much the combined loss, 
g(RiR2)', differs from unity. Obviously, the smaller this difference, the 
greater can be the experimental errors to obtain a fixed accuracy in 


1568 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


g(RiR2)*. For a value of g(RiR2)* near 0.99, a 10 per cent error made in 
the measurement of 7/7. would give an error of only one part in a 
thousand in the value of g(RiR2)*. 

To obtain a good signal-to-noise ratio in the electrical output of the 
photodetector, it is important to obtain a maximum amount of light 
transmission through the interferometer. From (10), the fraction of the 
incident light transmitted through the cavity is given by (see Appendix) 


7 [1 — g(RiR2)i}? 


Equation (10) shows that for g = 1 and R, = R,, Im is unity for any 
value of reflectance, but if Ri ~ Re, Im will be less than unity. For ex- 
ample, if Ri = 0.995, R. = 0.97 and g = 1, then 7, = 0.49. The remain- 
ing 51 per cent of the light is reflected back towards the source. Therefore 
the two mirror reflectances should be identical for maximum transmis- 
sion. For measurements of transmission losses in the cavity, there is 
some advantage to be obtained by not having the mirror reflectances 
too great. As an example, for g = 0.97 and R; = Ry = 0.99, Im = 0.062. 
For the same loss and R; = R, = 0.97, In = 0.25. In the second case, 
however, there is a greater chance of making an error in measuring g 
since it is a smaller fraction of the total loss. 

The shape of the output pulse expressed asa function of time is given 
by (12) of the Appendix 


(10) 


Lm 
1+ 42/7; ° 


The frequency spectrum of this time pulse is 


I(t) (12) 


Fo) = 2 Pylm exp (- Plel). 


Now, the value of T’, is a function of the velocity of the moving mirror 
and hence, subject to the limitations on velocity discussed above, may 
be made as large or small as is desired. In our laboratory, the motion of 
the moving mirror was so selected that for a 1 per cent total cavity loss, 
the pulse width is about 30 usec, and the frequency spectrum is down to 
1 per cent of its low-frequency value at about 50 kc. The required band- 
width of the photodetector and its associated electrical circuits, including 
the viewing oscilloscope, is increased by a decrease in the total optical 
loss, as this decreases the pulse width, 7’, . In order that the system be 
capable of measuring optical losses as small as 0.25 per cent, the over-all 
bandwidth of the electrical components should be at least 200 ke. 

The above theory was developed for the assumptions that the incident 
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beam of monochromatic light was collimated and that the plane surface 
mirrors were infinite in size. For finite-size mirrors with plane or spherical 
surfaces, and for a finite-size input light beam diameter, the electromag- 
netic energy inside the interferometer can be described by the familiar 
TEM modes.*:*” For a given input light beam condition and for a given 
set of mirrors, the energy within the cavity can be characterized by 
selecting the appropriate amplitudes of the TEM modes. If the input 
beam spot size is too large or too small, a large portion of the input en- 
ergy will be found in higher transverse modes of quite large order. If the 
laser is adjusted to operate in only the TEMoo, mode, then it has been 
shown’ that the light energy will be principally in the fundamental 
TEMoog mode in the cavity, if the spot size and the surfaces of constant 
phases of the input beam are both equal to those for the TEM o,, reso- 
nant cavity mode. As is usual in matching problems, these conditions are 
not too critical. 

When the measuring interferometer has spherical mirrors at nonconfocal 
spacing, an incident light ray at a small angle off the system axis will 
produce repeated reflections, which in general will trace an ellipse on the 
mirrors.? Under special conditions, the points of reflection lie on a circle 
and are displaced by some angle after every round trip. When this angle 
is a multiple of 27, the rays will exactly retrace their paths, and the trace 
of reflections on a mirror will break up in a number of separate and 
equally spaced dots. Under these reentrant conditions, the cavity will 
become resonant not only at a multiple of the fundamental cavity fre- 
quency, but also at multiples of a much lower frequency which is not 
quite a subharmonic of the fundamental cavity frequency. For these 
conditions, the oscilloscope pattern of the photocell output will show, in 
addition to the main response, a number of smaller equally spaced pulses. 
Since the response of one of the ‘‘off-axis’? modes can coincide (or nearly 
coincide) with the main response, the measurement of the loss under 
these conditions can be considerably in error unless these off-axis mode 
responses are made very small. 

In general, the system should be designed to be nondegenerate. That 
is, the length of the interferometer must be so selected as to avoid any 
possible overlapping of the cavity resonant response of the different 
orders of the TEM modes to any of the several optical frequencies 
present in the laser source. Usually, this is not difficult to accomplish. 


III, MEASUREMENT SYSTEM 


A block diagram of the components in the interferometer measuring 
system is given in Fig. 2. The He-Ne laser has external spherical mirrors 
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Fic. 2 — Schematic diagram of the interferometer measuring system. 


and Brewster angle output windows, and operates at 6328 A. To reduce 
the effects of vibrations, the laser and the cavity structures were made 
very rigid by using a construction similar to that used by Bennett in 
his magnetostrictively tuned laser.!° The interferometer mirrors are 
mounted with suitable tilt controls in 6-inch square steel end blocks, 1 
inch thick. The two blocks are tied rigidly together at each of the four 
corners by 1-inch diameter Invar rods 16 cm long. The interferometer 
cavity, shown in Tig. 3, uses a piezoelectric transducer to vary the mir- 
ror spacing in the cavity. It is a ceramic cylinder 14 inches ID, 13 inches 
OD, and 13 inches long.!! The mirror holder is epoxied to one end of the 
cylinder. The other end is epoxied to a mounting plate which is fastened 
through suitable tilt controls to the steel end block. The laser, inter- 
ferometer, and other optical components were fastened rigidly to a 
heavy steel optical table. This 4 by 8-foot table is supported on six small 
airplane inner tubes encased in heavy canvas covers. For this type of 
support, the natural frequency of the table is about four cycles per 
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Fig. 3 — Photograph of the interferometer apparatus 
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second. To avoid air currents, both the laser and the optical cavity are 
enclosed separately in Plexiglas boxes. 

The isolator between the laser and the cavity was found to be essential 
to prevent interaction between the two optical cavities. It is a circular 
polarizer consisting of a polaroid analyzer and a quarter-wavelength 
plate. This circular polarizer will absorb any light reflected back from 
the interferometer, as the sense of rotation of the circularly polarized 
light is inverted upon reflection and hence will not be transmitted back 
through the polaroid analyzer. For this type of isolator, all loss measure- 
ment must be made with circularly polarized light. 

The photodetector is a standard electron multiplier phototube with 
an S-20 cathode. A bandpass optical filter, 200 A wide, centered at 6300 
A, is placed between the source and the detector to eliminate most of the 
background light. 

A 20-cycle triangular wave shape generator delivers 500 volts pp to 
the piezoelectric mirror drive. For this voltage, the motion of the mirror 
is about one micron, which is about three half-wavelengths of the 6328 
A laser source. The motion of the mirror is parallel and was checked by 
using an alignment telescope with a flat mirror placed in the movable 
mirror holder. The fringe pattern of the alignment telescope reflected 
back from the moving mirror showed no discernible change when 1000 
volts de or ac was applied to the driver. Therefore any mirror tilt varia- 
tion must be less than 5 seconds of are. 


IV. EXPERIMENTAL PROCEDURE 


When spherical mirrors are used in the interferometer, the ray of 
incident light must be ona line passing through the centers of curvature 
of both mirrors. With the aid of a thin optical lens to vary the incident 
angle, the cavity can readily be aligned to minimize the “off-axis” 
modes.?® 

The visible red gaseous laser, which was one meter long, was so ad- 
justed that it oscillated only in the fundamental transverse mode and 
at several longitudinal modes. 

In spite of all the precautions taken to eliminate building vibrations 
and air currents, the output pulse pattern on the output scope shows a 
considerable amount of time position jitter for any one pulse whenever 
the scope sweep speed is increased to be able to measure the half-power 
pulse widths. When the sweep speeds are made 10 usec per cm to view a 
30-usec output pulse, the excursion of the time jitter is about +50 
usec, and the jitter frequency is about two cycles or less. 

To overcome the effects of the time jitter, a photographic method was 


1572 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


developed to determine the pulse width. For a single period of the 20- 
cycle drive on the piezoelectric driver, the time jitter is small. The half 
height of the output pulse was determined by using the following elec- 
tronic circuits. In the amplifier following the photocell, a fast transistor 
switch, operating at a one-megacycle rate, reduces the voltage gain of 
the network periodically by a factor of two (6 db). With this arrange- 
ment, a single photograph shows simultaneously both the full-height and 
the half-height pulses, as shown in Fig. 4. The time position jitter in the 
pulse pattern is small enough that the fundamental pulse group spacing 
can be determined directly from the scope. The measurement of the 
finesse of the cavity does depend upon the accuracy of the various sweep 
rate calibrations. According to the manufacturer of the oscilloscope, these 
sweeps, once calibrated, should remain accurate to several per cent for 
several months. 

In order to obtain repeatable loss measurements, it is Important to 
have the laser operate with a stable mode pattern output. At times, this 
was found to be difficult because adjacent longitudinal modes would 





Fig. 4— (a) Fundamental cavity spacing. Laser source has three output 
frequencies. Sweep rate is 2 msec per cm. (b) Half width of one of the cavity re- 
sponses. Sweep rate is 10 usec per cm. The indicated half width is 34 usec. 
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compete with each other, thereby producing an erratic pulse pattern 
output from the interferometer. This condition was improved by reduc- 
ing the length of the laser to 70 cm to separate the adjacent modes farther 
in frequency. 


V. EXPERIMENTAL RESULTS 


The reflectance was measured for a number of the multiple dielectric 
coated mirrors whose radius of curvature varied from 2 meters to infinity. 
The highest value of reflectance was 0.995 and the lowest was 0.940. 
The average mirror reflectance was about 0.990. The accuracy to which 
the reflectance of 0.990 can be repeated was about +0.001. Since the 
reflectance for an individual mirror was calculated from the three loss 
measurements for three mirrors taken two at a time, the error in the 
individual mirror reflectance was probably twice that of the single 
measurement, or -++0.002 maximum. This leaves something to be de- 
sired. The largest source of error is in determining the half-power- 
height pulse width. The photographic method permits a determination 
of this width to about +5 per cent under ideal conditions. 

The transmission loss through a fused quartz (Homosil) Brewster 
angle window with fairly high-quality surfaces was found to be about 
0.25 per cent. These and all other transmission loss measurements were 
taken with two specimens at opposite Brewster angles in the optical 
cavity, so that the deflection of the light ray passing through the samples 
canceled out. The transmission loss through the quartz was measured by 
comparing the loss of a sandwich of three quartz blanks to that of just 
two windows, where an index of refraction matching liquid was used to 
overcome the surface irregularities at the interfaces in both cases. This 
loss was about 0.2 per cent + 0.05 per cent for a 34 cm optical path length 
in the Homosil. This would give a transmission loss of about 1.0 db per 
meter for high-quality optical Homosil. Using the same technique, the 
transmission loss through cast Plexiglas II, properly annealed, is about 
2 db per meter. 


VI. CONCLUSIONS 


The interferometer method has proven to be capable of measuring 
very small optical losses. It requires a number of special precautions, 
such as a stable mode pattern output from the laser source, a careful and 
correct alignment of the interferometer, all possible reduction of the 
effects of building vibrations, and a large degree of optical isolation be- 
tween the cavity and the laser. This method measures the optical loss at 
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only one small spot in the cavity, and this is measured using circularly 
polarized light. It is believed that the system accuracy can be further 
increased by developing an improved method of measuring the finesse of 
the system. 
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APPENDIX 


The theory for the loss measuring optical cavity using plane mirrors 
is that of the Fabry-Perot interferometer, given many times before,” 
modified to include loss within the cavity, different reflectivities for the 
two mirrors, and the first-order effects of the velocity of the moving 
mirror. The exact theory for the effect of the moving mirror’s velocity 
upon the interferometer response was developed. However, it was found 
that after neglecting some of the higher-order velocity terms, the same 
expression for the first-order velocity effects could be more readily ob- 
tained by assuming the mirror spacing to be fixed and by linearly varying 
the input frequency. Only the simpler theory will be given here. These 
first-order velocity effects for the interferometer, which has a Lorentzian 
frequency response, will be shown to be appreciably different from those 
previously calculated for the Gaussian filter.” 

We assume that the incident light beam is collimated, monochromatic, 
and perpendicular to the mirrors. It is also assumed that the index of 
refraction of the space between the two mirrors is equal to that of free 
space, and that the mirrors have no loss. The list of all symbols and 
definition of all terms used in the following theory are given below: 


a = 2x(Af/At) = angular sweep rate 
a = (—4nvt/d) 

B = half-power bandwidth 

Bo = 2mrv/c 

c = velocity of light in free space 


dy = fixed mirror spacing 

E, = total combined electric field of all the output light rays 
assuming a unit input 

power loss per single pass through any material within the 
cavity expressed as a ratio 


< 
I 
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go = gd — Ri) — Rd] 
I, = total output light intensity assuming unit light input 
Im = maximum output from the interferometer assuming a unit 
light input 
> = free-space wavelength of light source 
m & (2do/r) = large integer 
R,, R. = power reflectance of the two mirrors expressed as a ratio 
t = real time 
T,, = pulse width in time of the output pulse at half peak power 
level 
T. = fundamental pulse group spacing in time 
v = (Ad/At) = velocity of moving mirror 
a = g(RiR2)*. 


Let the instantaneous angular frequency input be 
® = wo — ab. (1) 


Then the instantaneous phase is ¢ = wot — (at’/2). 

If the fixed mirror spacing is dy and the input light ray is normal to the 
mirrors as indicated in Fig. 5, then the total time delay for the nth output 
ray is 


do do 
tT +t = 2n— + — 
c c 
where the delay for the initial ray is 
do 
TM~=—. 
c 
pe ee NOTE: IN PRACTICE, ALL RAYS 
LASER in 0 Al ARE AT RIGHT ANGLES TO MIRRORS 





PHOTO 


_—> 
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Fic. 5 — Light path through interferometer mirrors to output. 
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Thus the phase of the nth output ray is 


| 2ndo a] a | 2nd ay 
Pn = way] tb — See a aed 1 
c Cc 2 Cc Cc 


The voltage for the nth output ray is 








E, = Jox" exp (—Jjen). 


The term independent of n can be neglected since its magnitude is unity. 
Thus the total output voltage is given by 


ae dy d do” 
= Di gor exp {— ee — a(x“) | - jean’ (2) 


The relation between the rate of change of the angular frequency, a, and 
the velocity of the mirror may be found as follows 


a= on Af - 28 2Ad © _ ey (3) 


Now, the velocity of the mirror is so small that in any reasonable length 
of time the variation in the mirror spacing is very small compared to the 
initial spacing. Hence, the exponent of (2) can be written as 


a E 7 a(t a) | + 2ant 2 . = nl2me + alt nb (4) 


where a is small, 


m & 2d)/rX = large integer, 


Il 


and 
Bo = 2xrmv/c. 
Thus from (2) and (4), the total output voltage is given by 


Ly = dX golx exp (—ja)]n exp (—JBon’). 
Now 
Ir = | Er le 
Thus 


e) 2 vs) 2 
lye Pp gox” cos (an + aun’) | + | 3 a" sin (an + pun’) | . (5) 


The above expression is a good approximation to that obtained by the 
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exact theory for v/e < 10°. At the present time, the series in (5) can 
only be summed numerically, since any other approximation results in a 
slowly convergent infinite series. The number of terms required by the 
series may be calculated as follows: 


WE i) 2 N-1 (oe) 2 
*s[oe]-(De+d2| 
n=0 n=N 


go" n=0 


(1 — a)? + Qe%(1 — 2%) + 0 

(ta) 
Thus the ratio of the total error in stopping the series after N — 1 terms 
to the actual value is 


= 





N 2N 
S270 Se: 


The number of terms required is then given by 


_ log (2/e) _ log (2/e) 
~ log (1/z) ~ 1-2 (6) 


Thus if 
e = 1/1000 and zx = 0.9975, 
N = 3040. 


Hence for x © 1 a very large number of terms must be used in the series. 

The series in (5) was computed for a number of values of x, and Bp , 
some of which are given in Fig. 1. From these values, it was found that 
the reduction in the maximum of the output response, J,/Im, and the 
increase in the half-height pulse width, 7.,,/T , produced by the mirror 
velocity can be expressed as 


(Imn/Ip)? = Tn/Tp % {1 + [13.8 Be/(1 — x)*f} (7) 


The above expression matches the computed values only when the first- 
order velocity effects are the order of a few per cent. For higher values, 
the computed results are less than those given by (7). 

The relation given in (7) may be compared to that developed for a 
Gaussian filter which might be used in a spectrum analyzer. It has been 
shown’ that the loss in sensitivity, S/S, and the increase in apparent 
bandwidth, B,,/B, produced by sweeping the frequency in the spectrum 
analyzer is given by 


Q)-B-[iram(eahT 


where the above relation was calculated on a power basis. 
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In the interferometer, the mirror spacing is varied linearly with time. 
Hence the ratio of any two time intervals may be replaced by the ratio 
of their appropriate frequency differences. Thus from (3) and (11), it 
can be readily shown that . 


ee ey ee ag (8a) 


From (8a), (7) becomes 


TaN at; 1 ty 
(7) aa E +18 a 


which is an order of magnitude different from the relation given in (8). 

When the mirror velocity is small enough to be neglected, the output 
from the interferometer is given by the first terms of (5). Thus for 
Bo = 0, 


ips ee eee ee, | eee (9) 
7 1—2ecsat2 (1 — 2)? + 4c sin? (a/2) * 
Now the maximum value of [7 occurs at a = 27k, k = 0,1, 2,3, ---, 
hence 
2 
go _ g(1 — Ri)(1 — Rz) 
Ln = I Max: Soo oN a ee a ee 10 
U2)wex = Gay = Tg RRP a 
Irom these relations, it can readily be shown that for x nearly unity the 
finesse of the system is given by” 
r= Te w/a 


T 1—-sx 





The solution for z is then 








_ ag ~ the a A a) 

This equation relates the optical loss to measurable quantities for the 
pulse response of the interferometer. The time function for the output 
pulse can be obtained from (9), 


i. 


te TE Ge/T SA) 


(12) 


for x close to unity. 
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On the {-Boundedness of Solutions of 
Nonlinear Functional Equations 


By I. W. SANDBERG 
(Manuscript received April 8, 1964) 


Let &y denote the set of N-vector-valued functions of t defined on [0,@ ) 
such that for any real positive number y, the square of the modulus of each 
component of any element is integrable on [0, y], and let Lov(0, ©) de- 
note the subset of &y with the property that the square of the modulus of each 
component of any element is integrable on [0,< ). 

In the study of nonlinear physical systems, attention ts frequently focused 
on the properties of one of the following two types of functional equations 


g=f+KQf 
g=Kf+QOf 


in which K and Q are causal operators, with K linear and Q nonlinear, 
g € &y, and f zs a solution belonging to Ey. Typically, f represents the 
system response and g takes into account both the independent energy 
sources and the initial conditions at t = 0. 

It is often «important to determine conditions under which a physical 
system governed by one of the above equations is stable in the sense that the 
response to an arbitrary set of initial conditions approaches zero (1.e., the 
zero vector) ast > ©. In a great many cases of this type, g belongs to 
Ley(0, 0) and appreaches zero as t > © for all initial conditions, and, in 
addition, tt ts possible to show that af f ¢ Sax(0, © ), then f(t) ~Dast— ~, 

In this paper we attack the stability protlem by deriving conditions under 
which g € Lan (0,0) and f € &y imply that f ¢ Lav(0,0). From an engi- 
neering viewpoint, theassumption that f ¢ &y ts almost invariably a trivial re- 
striction. 

As a specific application of the results, we consider a nonlinear integral 
equation that governs the behavior of a general control system containing 
linear time-invariant elements and an arbitrary finite number of time- 
varying nonlinear elements. Conditions are presented under which every 
solution of this equation belonging to &y in fact belongs to Lax(0,~) and 
approaches zeroast— . 
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I. NOTATION AND DEFINITIONS 


Let M denote an arbitrary matrix. We shall denote by 1’, M*, and 
M™, respectively, the transpose, the complex-conjugate transpose, and 
the inverse of M. The positive square-root of the largest eigenvalue of 
M*M is denoted by A{M}. 

The set of complex measurable N-vector-valued functions of the real 
variable ¢ defined on [0,0)[(—%,)] is denoted by 3Cy(0, )[5Cy 
(— 0,0 )], and 


San(0, &) = {il fe sev(0, 2), [fa < oh. 


In order to be consistent with standard notation, we let £.(0,0) = 
Lan(0,%©) when N = 1. We shall not distinguish between elements of 
3w(0, 0 )[ICw(— 0,0)] that agree almost everywhere on [0,~) 
[(— «,«)]. The range of any operator considered in this article is as- 
sumed to be contained in either 3Cy (0,0) or Hy (— 0,0). 

The inner product of two elements of £2y(0,0),f = (fi, fe, --+, fx)’ 
and g = (g1, 92, °** , gw)’, is denoted by (f,g) and is defined by 


(ia) = [fod 
0 
The norm of f ¢ £2y(0, © ) is denoted by || f || and is defined by 
fll = OY. 


The norm of a linear operator T defined on £.y(0,%) is denoted by 
! i y € (0,2), and define f, by 
fut) =fO forte [0,y] 
=0 fort>y 
for any f € 3Cy(0, ©), and let 
Sv = {f|feHnv(0,~), fy e Lov(0,0) for0 <y < ~}. 


The set of real vector-valued functions is denoted by ®, and I and 
ly, respectively, denote the identity operator on L2y(0,) and the 
identity matrix of order N. 

With A an arbitrary measurable N X N matrix-valued function of ¢ 
with elements {@nm} defined on [0,), let Kpw (p = 1,2) denote 


(ALL Leone) Pat < (nm = 1,2, ++ 8h. 
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Let ¥[f(t),é] denote 
(val fit) 4], wal fo(t),t], ara Ywlfw(t),t])’; a ean 5Cw(0, 0) 


where yi(w,t), wo(w,t), «+: , ww(w,t) are real-valued functions of the 
real variables w and ¢t for -« <w< © and0 St < © such that 
(2) wn(0,t) = O forte (0,0) andn = 1,2,---,N 
(77) there exist real numbers a and 8 with the property that 


aoe Yn(w, t) 
_ w 


IA 


B (n = 1,2,---,N) 


for te [0,) and all real w + O. 

(272) y,[w(t),t](n = 1, 2, --- , N) is a measurable function of ¢ when- 
ever w(t) is measurable. 

The symbol s denotes a scalar complex variable with o = Refs] and 
w = ImfJs]. 

We shall say that a (not necessarily linear) operator T with domain 
O(T) C Ky(0,) is causal if an only if for an arbitrary 6 > 0, 


(Tf)(¢) = (Tg)(t) a.e. on (0,6) 
whenever f,g € O(T) and f(t) = g(t) a.e. on (0,6). 


Il. INTRODUCTION 


In the study of nonlinear physical systems, attention is frequently 
focused on the properties of one of the following two types of functional 
equations 


g=f+KQ/ (1) 
g = Kf+Of (2) 


in which K and Q are causal operators, with K linear and Q nonlinear, 
g € &y, and f is a solution belonging to &y. Typically, f represents 
the system response and g takes into account both the independent 
energy sources and the initial conditions at ¢ = 0. 

It is often important to determine conditions under which a physical 
system governed by one of the above equations is stable in the sense 
that the response to an arbitrary set of initial conditions approaches 
zero (i.e., the zero vector) ast — ©. Ina great many cases of this type, 
g belongs to L2y(0,) and approaches zero as ¢ — © for all initial 
conditions, and, in addition, it is possible to show that if f ¢ Ley(0,~), 
then f(t) ~ Oast— o~. 
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In this paper we attack the stability problem by deriving conditions 
under which g ¢ £ey(0,0 ) and f € &y imply that f ¢ Ly(0,0). From an 
engineering viewpoint, the assumption that f ¢ &y is almost invariably a 
trivial restriction. 

As a specific application of the abstract results of Section III, we con- 
sider, in Section IV, the following integral equation which governs the 
behavior of a general control system containing linear time-invariant 
elements and an arbitrary finite number of time-varying nonlinear 
elements: 


g(t) = f(t) + | k(t — r)Wlf(r), dz, = t 20 (3) 


in which k ¢ Kiy MN Key , ¥[-,-] is as defined in Section I, and g ¢ Lay(0, ©). 
We prove that every solution f of (3) belonging to ® N &y in fact belongs 
to L.y(0,%) and approaches zero as t —> © if, with 


K(s) =| ke" dt for «ZO, 
0 


(¢) det [lv + 3(a + 6)K(s)] 40 foro 20 
(i) 3(8 — a) sup A{flw + 2(a + B)K(iw)|"K(iw)} <1. 


An analogous result is proved for the integral equation 


g(t) = ylf(), + i k(t — f(r) dr, +20. 


For N = 1, the key condition (27) possesses a simple geometric interpre- 
tation: it is satisfied if and only if the locus of [K(iw)]' for —«o < 
w < © lies outside the circle of radius $(@ — a) centered in the complex 
plane at [~3(a + 8),0].T 

In Section V we consider two direct applications to nonlinear differ- 
ential equations. One of our results asserts that if f is any real-valued 
function of ¢ defined and twice-differentiable on [0,) such that 


af 4 a2 


for almost all ¢ ¢ [0,0 ), where g e RM &2(0,~0), ¥[-,-] is as defined in 
Section I with N = 1 and a > 0, and a is a real constant such that 
a> VB — Va, then f ¢ £:(0,©) andf(t) ~Oast— ». 


+ For some earlier results concerned with frequency-domain conditions for the 
stability of nonlinear or time-varying systems, see Refs. 1-4. 


NONLINEAR FUNCTIONAL EQUATIONS 1585 


III. KEY RESULTS 
Assumption 1: It is assumed throughout that 


(t) K is a linear causal operator with domain ©(K) such that 
Lov(0,0 ) C D(K) C By(0,0) 
(71) K maps £2y(0,) into itself, and is bounded on L2y(0, © ) 

(777) Q is a (not necessarily linear) causal operator with domain 
D(Q) C Hy(0,~ ). 

The following two theorems are the key results of the paper. 
Theorem 1: Let f eD(Q) A &y such that Of e D(K) N Ey , KOf ¢ Sy , and 
g = f + KOf, where g € Lay(0,0 ). Let f not be the zero-element of &y, 
and let yo = inf {y|y > 9, || fy || # O}. 

Suppose that {f,,0 << y < ©} C D(Q) and that there exists a real or 
complex number x such that 

(4) on £oy(0,0), (I + 2K) exists and is causal 


YY | ty | 


Then f € Lon(0,% ) and 
lil sa =)" -G+4K)¢ ll, 


in which 
r= | (I at aK) 'K I sup | (Of y = wfy | ; 
v>vo Il fu I 
Theorem 2: Let fe 2(Q) N B(K) /A &y such that Kf ¢ &y , Of € Sv, and 
g = Kf + Of 


where g € Lay(0,0). Let f not be the zero-element of &y, and let yo = 
inf {y|y > 9, |l fy || # Of. 

Suppose that {f,,0 << y < ©} C D(Q) and that there exists a real or 
complex number x such that 

(7) on &y(0,%), (21 + K)™ exists and is causal 


(it) 1 (aI oh K)” | sup l ee rfy l <i. 





Then f € Lan(0,~ ) and 
If lls Q—@ "| @l+K)"9 |; 


an which 


q = || I+ K)~ || sup (Qh = thy Il 
Y>vo ll fy | 
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3.1 Proof of Theorem 1 


It is convenient to introduce the operator P defined on 3Cy(0,% ) by 
Pf = f, , where y is an arbitrary real positive number. 
From g = f+ KQf, we clearly have 


Jy = fy + PKQS. 
Since K is causal, 

dy = fy + PKPQS. 
Similarly, since Q is causal, 

Jy = fy + PKPQP/ 

= fy + PKPQ/,. 
Thus, 
Jy = P(I + 2K)f, + PKP(Q — al)f,. 
Since on L2y(0,0), (I + 2K)” exists and is causal, 
P(I + «K)"P(I + 2K)f, = fy, 
and hence, 
fy = —P(I + «K)PKP(Q — alf, + P(L + aK) “gy. 
It follows that 
ll fu || = || PA + 2K) PK |||] PQf, — afy || + |] PO + 2K)“q, | 
Moreover, in view of the causality of (I + xK)™, 
P(I + «K)™ PK = P(I + 2K) 'K, 
and hence, using the fact that P is a projection on £L2y(0,), 
|| PCI + 2K) 'PK || S || (I + 2K) K J. 
Similarly, 
|| P(I + eK) “gy || = | (+ 2K) “g I. 
Thus, with r as defined in the statement of the theorem, 
fv ll Sr lf + I] + 2K) “9 || 

a 


lf] S$ Q—4r)* | + 2K) I. 
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Since this inequality is valid for arbitrary positive y, it follows that 
f € &av(0,) and 

fils —7r)* | + 2K)" IL 


3.2 Proof of Theorem 2 


The argument is essentially the same as the one used in the proof of 
Theorem 1. 
We have, with P as defined in the proof of Theorem 1, 


Jy = PKf + PQS 
PKPf + PQPf = PKf, + PQS, 
= P(aIl + K)f, + PQQ — 2Df,. 
Using the fact that on &oy(0,), (aI + K)™ exists and is causal 
fy = —P(al + K) "PQ — af, + Pal + K)"'g,. 
Thus, with q as defined in the statement of the theorem, 
lf ll Salli +l @I+K)%9 |, 


lA 


or 
IFS Q-—9 "| @I+K) gl. 

This inequality is valid for arbitrary positive y. Hence f ¢ £2(0, © ) and 
fll S$ GQ—-—4@)" | GI+K)"9l. 


Remark: A moment’s reflection concerning the proofs of Theorems | and 2 
will show that by simply reinterpreting the symbols, analogous results 
can be obtained for other function spaces. 


IIA 


3.3 Conditions under Which the Hypotheses of Theorems 1 or 2 Con- 
cerning x Are Satisfied 


The following theorem asserts that the hypotheses of Theorems 1 or 
2 concerning x are satisfied in certain special but very important cases. 
The implications of the theorem are of direct interest in the theory of 
passive nonlinear electrical networks. 


Theorem 8: Let f be as defined in Theorem 1 {Theorem 2]. Suppose that 
(2) there exist a nonnegative constant k, and a positive constant k. such 
that 


Re Qh fd Z aA, OH" sSkilf iP  ford0<y< 
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(17) K maps Len(0,) into itself such that there exists a nonnegative 
constant c with the property that 
Re (Kh,h) = ¢ || h|l’ 
for all h € Soy(0,0). 


Then the hypotheses concerning x of Theorem 1 [Theorem 2] are satisfied 
af either: 
ki >0O and c20, 


or 


ky =0 and c>0O. 


3.4 Proof of Theorem 3 


Lemmas 1, 2, and 3 (below) imply that for real positive x the oper- 
ators (I + 2K) and («I + K) possess causal inverses on £2y(0,%) and 


< 2/Kl|—¢) +2] K|P 


-1 2 
Wa ER Sod TED 
—1 12 1 


With x real and positive, 

| (Quy — afy IP S I Qy — afy IP 
|| Qfy ||P — 2a Re (Qty sfx) + 2” I fy IP 
S (ky — Quek, + 2°) |I Ff, |). 


lA 


IIA 


It is a simple matter to verify that if (4; +c) > 0, there exist positive 
values of x such that 


2 
eS an 2 eS eee Oe ee 


«(1 + x || K ||)? 
and there exist positive values of x such that 
ke = 22k; + x Z 1. 
x(« + 2c) 


Hence, it remains to prove Lemmas 1, 2, and 3. 


Lemma 1: Let T be a bounded linear mapping of L2n(0,%) into itself 
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such that there exists a constant c, > —1 with the property that 


Re(Tf,f) = ce: || f |!’ 


for all f ¢ &ay(0,0). Then (I + T) possesses an inverse on Lon(0,  ). 
Proof: 

Since c, > —1, it is evident that there exists a positive constant ky 
such that 


Re ((I + T)f,f) = hr || f IP? 


for all f ¢ Ley(0, ©). This, together with the boundedness of T, implies 
that (I + T)~ exists (see Ref. 5, for example). 


Lemma 2: Let T be an invertible bounded linear mapping of L2n(0,~ ) into 
itself such that T is causal and Re (Th,h) = 0 for all h € Lay(0,). Then 
T is causal. 
Proof: 

A bounded linear mapping A of £2y(0,%) into itself is causal if and 
only if® 


Re [ (an'yat 2 - Al [fra 


for all real y = O and all f ¢ L2y(0,0). Thus, to prove the lemma it 
suffices to point out that the causality of T implies that 


y 
Re | (Tg)* dt = Re (Ty, g,) 2 0, 
0 
for all real y = O and all g € L2y(0,), and hence that 
y 
Re [ h*Th dt = 0 
0 


for all real y = O and all h € Loy(0, 0). 


Lemma 8:} Let T bea bounded linear mapping of L2n(0, © ) into itself such 
that (I + T) ts invertible and there exists a real constant c2 with the property 
that 


Re (Tf,f) 2 c If I 


for all f ¢ Lay(0, 0). Then, for ce 2 —§, 


t Lemmas 1 and 8, and their proofs, remain valid if £2y(0,<) is replaced with 
an arbitrary Hilbert space with inner product (-,-) and norm ||-]]. 
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J+ TT] S01 - Get HD+ IT? 
and, for ¢, > —%, 
| (L+T)7 |] $ (1 + 20). 
Proof: 


In order to establish the first inequality, let g = (I + T)'Tf and, 
using the fact that g = f — (I + T)’’f, observe that 


(9,9) = (ff) — 2 Re (Tz,z) — (2,2), 
where z = (I+ T)7/f. Since 
2 Re (Tz,2) + (2,2) = (2e2 + 1) || 2 |) 
and 
Ieh20GtnN Ur ile @+FUTI Ws, 
it follows that 
(9,9) S$ [L — (Qe + 1)(1 + TIKES 
for all f, g € Lan(0, ) such that g = (I++ T) Tf. Thus 
| + TTI st — Qe +1) + TI) 7. 


The second inequality follows directly from the fact that if g = 
aD), 


If? = |g |? + 2 Re (Tg,9) + || Tg |? = (1 + 2e2) [I g ||”. 


IV. APPLICATIONS TO NONLINEAR INTEGRAL EQUATIONS 


In this section our primary objective is to prove the following two 
theorems. 


Theorem 4: Let k € Kiy and let 
t 
g(t) = st) + f RE - WYr), rdr, 620 
0 
where g € en(0,0) and fe RAN Ey. Let 


K(s)'= [ k(e* dt, « =O. 


0 


Suppose that 
(7) det [lv + 3(a@ + 8)K(s)] 40 foro 20 
(i) 2(B — a) sup A{[lw + 3(@ + 8)K(iw)J"K(iw)} < 1, 
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Then f € Lon(0,0). 


Theorem 5: Let k ¢ K,y and let 
gt) = [ Me- Ae) dr + yd, 620 
where g € Lay(0,0) and fe RAN Ey. Let 
k(s) = [ ke", 6 BO. 


Suppose that 
(z) det [3(a + B)ly + K(s)] 40 foro 2 0 
(ii) 3(8 — @) sup Af[R(a + Bly + Kw) "} <1. 


Then f € £2y(0,@ ). 


4.1 Proof of Theorems 4 and 5 
In Theorems 1 and 2 let Q denote the operator defined by 
(Qg)(t) = vig), OSt<% 


where g is an arbitrary element of ® /M 3Cy(0,). This operator maps 
R | Lov(0,~) into itself and possesses the property that for any real x 


|| Qh — zh || S n(x) | h Il, he @N Loy(0,~) 
where 
n(x) = max [(a — a), (B — 2)). 
Thus, with K defined on £2y(0, 2) by’ 
Kh = a k(t — r)h(r) dr, he Lav(0, &), 
condition (72) of Theorem 1 and the corresponding condition of Theorem 
2, respectively, are satisfied if there exists a real x such that 
| + 2K)"K || n(x) <1, 
and 
|| (@I + K)™ | a(x) < 1. 


Lemmas 4 and 5 (below) imply at once that if the assumptions of 
Theorem 4 (Theorem 5) are met, then hypotheses (2) and (zz) of Theorem 
1 (Theorem 2) are satisfied with x = }(a@ + 8). It can be shown! (with the 
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aid of Lemma 4) that this choice of x is optimal in the sense that if there 
exists a real x such that (I + xK) possesses a causal inverse on Loy(0, ©) 
and 


| (I+ aK)" K]] (x) < 1, 
then [I + 3(a@ + 8)K] possesses a causal inverse on L2y(0, ©) and 
| (I+ «K)"K || n(z) 2 || I+ 3(@ + 8)KI'K || nf$(e + 8)). 


This choice of x is similarly optimal with regard to the statement of the 
conditions in Theorem 5. 

Before proceeding to the statement and proofs of the lemmas, it is 
convenient to introduce a few definitions. 


4.2 Definitions 


With 7 an arbitrary positive constant, let S, denote the mapping of 
Lev(0, ©) into itself defined by 


S.A) = 0, i € (0,7) 
=f(t— 1), telr,@) 


for any f ¢ Lev (0, ~). 
Let 


faye, 0) = {slfetey(—e, =), [fat < oh, 
We take as the definition of the Fourier transform of f ¢ Lay(— ©,«): 
f = lim. [ : fie” al; 
and consequently, 
Qa [ria = [Fifa 
By the Fourier transform of an f € £2y(0,%) we mean simply 


Lim. i fe dt. 
0 


4.3 Lemmas 4 and 5 


Lemma 4: Let A be an invertible linear mapping of L2n(0,) into itself 
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such that for an arbitrary f ¢ Snv(0, ) 
AS,f = S-Af, tT> 0. 


Then A ts causal. 
Proof: 

Suppose that, on the contrary, A possesses an inverse on L2y(0,& ), 
but that it is not causal. Then there exist elements 2,, 22 € Lav(0,0), 
and aé > O such that z(t) ~ O on some positive-measure subset of 
(0,6), and Az; = Ssz.. Since A is assumed to possess an inverse, there 
exists a unique 23 € Lev(0,) such that z2 = Az; . Thus, 


AS323 => S;Az3 => S322 : 


Clearly, S3z3 > 2, , which contradicts the assumption that A possesses an 
inverse. This proves the lemma. 


Lemma 6: Let ue Ry and let U be the mapping of L2n(0,~) into itself 
defined by 
Us = i ult — r)f(r) dr, fr € Sav(0, &). 
Let 
U(s) = ia u(tye * dt, c= 0. 


Suppose that det [lv + U(s)] ¥ 0 fer « = 0. Then 


(2) (I + U) possesses an inverse on L2n(0,@ ) 
(7%) || I+U)"U| s sup A{[lw + U(iw) J" U(iw)} 
| C+ UY] S sup A{ly + Uw). 
Proof: 


Consider first the invertibility of the operator (I + UT) defined on 
L2y(— ©,%) by 


G+ Das [ we- flr) d, fe &x(—2, ©). 


The assumption that wu ¢€ Ky implies that the elements of U(iw) 
approach zero as |w| — «, and that they are uniformly bounded 
and uniformly continuous for w ¢ (—«,«). Thus, det [ly + U(tw)] 
approaches unity as | w|— ©, and is uniformly continuous for 
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we(— 2,0). It follows that det [ly + U(iw)] ~ 0 forall w implies that 
inf | det [ly + U(iw)]| > 0, 


and hence that 
sup A{[ly + U(iw)]"} < o. 


Let g denote the Fourier transform of an arbitrary g ¢ Ly(— 6,0). 
Then, 


[ Gly + Uliw)! [ly + UCiw) 19 do 


oe} 


: A Ly + Ua) GG de 


IA 


A 


Aes GAT [ PG diy es, 


and hence, by the Riesz-Fischer theorem, there exists an f ¢€ Ley(— ~,« ) 
with Fourier transform 
f = [ly + Uliw) 1G. 
This establishes the existence of (I + J)”. 
Since det [ly + U(s)] # 0 foro = 0, and U(s) ~0as|s|— © uni- 


formly in the closed right-half plane, every element of [ly + U(s)]”’ is 
analytic and uniformly bounded for « > 0. Thus, (I + U)-! maps 


(f | fe Lev(— 0,0), f(t) = 0 for t < 0} 


into itself," and hence the operator (I + U) defined on £L2y(0,% ) pos- 
sesses an inverse. 

To establish the first of the inequalities stated in the lemma, let 
f € Ley(0,% ) and let 


9,10 


g = (I+ U) "VU. 
Then, with g and f, respectively, the Fourier transforms of g and f, 
g = [ly + U(iw)J"U(io)f. 
Thus, 


[. 6°9 dw = [. f° U(iw)*[1y + U(tiw)} Uw + U(iw)|U (iw) f dw 
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A 


s [aly + Uw) Uo) iff do 


A 


S sup A[ly + UGw)"UGe)} [Pf de, 


from which, using Plancherel’s identity, 


lg || S sup {fly + U(iw)! 'U(iw)} | FI). 


Thus, 
| I+ Uj "Ul s sup A{[lw + U(iw)!"U(iw)}. 


By simply repeating this argument with (I + U)“U and [ly + U(iw)]7- 
U(iw), respectively, replaced with (I + U)™ and [ly + U(iw)]™, we 
find that 


| I+ U)™ || S sup Affty + U(io)T"}. 
This proves the lemma. 


4.4 Remarks 


It can easily be shown that conditions (7) and (77) of Theorems 4 and 
5 are satisfied if a > 0 and 


KGa) + KGa)" 


is nonnegative definite for all w. 

A moment’s reflection concerning the proof of Theorems 4 and 5 will 
show that those theorems remain valid if ¥|f(¢),t] and f, respectively, are 
replaced with (Qf) (é) and f ¢ O(Q), with the understanding that 

(a) Q is a causal mapping of a subset D(Q) of &y into &y such that 
Qh € Ley(0,) whenever h ¢ D(Q) M Lon(0,%), and there exist real 
constants a and 8 (8 > a) with the property that 


| Qh — 3(a + B)h || S (6B — @) hl 


for all h e D(Q) N Lon(0,~). 
(b) ifhke D(Q), {ty,O<y < ~} C O(Q). 


4.5 Conditions under Which f(t) ~ 0D ast— « 


Theorem 6: Suppose that the hypotheses of Theorem 4 are satisfied, that 
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g(t) ~Oast > ~, and that k € Key. Then f(t) ~O0ast— o. 
Proof: 
Observe first that the N-vector-valued function with values 


vif), OFt< &, 
is an element of ® M Loy(0,). Thus it suffices to show that if h ¢ 
Lov(0, 0), 
t 
[ k(t — r)h(r) dr 30 as to. 
0 


In terms of K(iw) and h(iw), respectively, the Fourier transforms of 
k and h, 


[ k(t — r)h(r) dr = = [ K(iw)h(iw)e™ dw. 


Since k € Key , it follows that the modulus of each element of the N- 
vector K(tw)h(iw) is integrable on the w-set (— »,«). Thus, by the Rie- 
mann-Lebesgue lemma 


t 
[ ke - Ahr) dr 0 de oy Sees 
0 


This proves Theorem 6. 
It is obvious that essentially the same argument suffices to prove the 
following corresponding result relating to Theorem 5. 


Theorem 7: Suppose that the hypotheses of Theorem 5 are satisfied, that 
g(t) ~O0ast— ~, and that k € Key. Then p[f(t),t] ~ 0 ast o. 
Vv. APPLICATIONS TO NONLINEAR DIFFERENTIAL EQUATIONS 


Theorem 8: Let A be an N X N matrix of real constants, let y[-,-] be as 
defined in Section I with a and B, respectively, replaced with & and B, and 
let f denote a real N-vector-valued function of t defined and differentiable 
on [0,0 ) such that 


a+ Af+ wi = 9 


for almost all t ¢ [0,0 ), where g e AN Lew(0, 0). 
Suppose that 
(i) det [sly + 3(&4+ B)ly + A] # O foro 20 


(vi) $(B — &) sup Ai[(iw)Iw + 2(@ + B)ly + AT} <1. 
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Then f € &2n(0,%) and f(t) ~ODast— ~. 
Proof: 

Clearly, fe @ NM Ey. Using the well-known expression for the solution 
of an inhomogeneous system of linear first-order differential equations 
in terms of the solution of the corresponding matrix homgeneous dif- 
ferential equation, and regarding 


g+3(&+ Bf — YA 


as the “forcing function,” we find that f satisfies 


t 
ome+ [ ea (r) dr 
0 


=fO+ [ot {uso db la+ ayr(e)| de 


for t ¢ (0,0), in which B = 3(& + 8)1y + A, and c is a real constant 
N-vector. 

In view of (7), the matrix e” is an element of Kiy M Key. By the 
argument used in the proof of Theorem 6, 


t 
[ e°'a(7) dr 370 as tom, 
0 
a property which is obviously shared by e ”‘c. Thus, using the fact that 


1 >. é Yn(w, t) = 3(a ae B)w l/s _ 


A 


(n a 1,2; ia gi) 
for all t ¢ [0,) and all real w ¥ 0, the theorem follows from a direct 


application of Theorems 4 and 6 [—3(8 — &) and 3(8 — 4), respec- 
tively, play the roles of a and 6 in Theorem 4]. 


5.1 Generalization of an Earlier Theorem Concerning a Linear Differential 
Equation with Periodic Coefficients” 


Theorem 9: Let p[-,-] be as defined in Section I with N = 1 and a and £, 
respectively, replaced with & and 8. Let f denote a real-valued function of 
t defined and twice-differentiable on [0,~ ) such that 

af df = 

qe} oR Wf, = 9 
for almost all t € (0,0), wherege R M £2(0,%) and a is a real constant. 
Then if a > O0anda> VB — V4, fe &:(0,0) and f(t) ~Oast— ~., 
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Proof: 
Proceeding as in the proof of Theorem 8, we find that f satisfies 


h(t) + [ k(t — r)g(r) dr 


La + A)f(r)} de 


=f0+ [ ke- NWO, 1-5 


for t € [0,~), in which h is a solution of 


ah (| a ae 

7p t tq + 5 lat Bh = 0, 
and ke Ru N Kai with 
-1 


K(s) = [ k(t)e* dt = E + as + ; (a+ 8) | 


oz 0. 


With a = —3(8 — &) and B = 3(8 — &), condition (2) of Theorem 
4 is obviously satisfied, while condition (77) reduces to 


4(B — &) < inf |4(4+ 8) — w 4+ tao|. 


It is a simple matter to show that this inequality is satisfied ifa@ > 0 
and a > V3 t/a Hence f ¢ £:(0,). 

Since h(t) > 0 ast— © and, by the argument used to prove Theorem 
6, 


t 
[ be ng(r) dr 0 as t—> 0, 
0 


Theorem 6 implies that f(t) — 0 as t— «. This completes the proof of 
Theorem 9. 


VI. FINAL REMARK 


Some of the results and techniques of this paper are useful in establish- 
ing sufficient conditions for the existence and uniqueness of solutions of 
functional equations of the type that we have considered. The reader 
familiar with the contraction-mapping fixed-point theorem has probably 
recognized this fact. 
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ERRATA 


On the Theory of Linear Multi-Loop Feedback Systems, I. W. Sand- 
berg, B.S.T.J., 42, March, 1963, pp. 355-882. 

On page 361, the expression (yi +42), which appears twice, should 
be replaced in both positions with (y: + 2), in which denotes the value 
of ye when yi = 0. 

On page 377, the left side of the first equation of Section 9.4 should 
be det Fs, , not det F 1. 


A Frequency-Domain Condition for 
the Stability of Feedback Systems 
Containing a Single Time- 
Varying Nonlinear 
Element 


By I. W. SANDBERG 
(Manuscript received May 6, 1964) 


It ts proved that a condition similar to the Nyquist criterion guarantees 
the stabelity (in an important sense) of a large class of feedback systems 
containing a single time-varying nonlinear element. In the case of principal 
interest, the condition ts satisfied if the locus of a certain complex-valued 
function (a) ts bounded away from a particular disk located in the complex 
plane, and (b) does not encircle the disk. 


I. INTRODUCTION 


The now well-known techniques introduced by Lyapunov have led 
to many very interesting results concerning the stability of time-varying 
nonlinear feedback systems governed by systems of differential equa- 
tions. However, these methods have by no means led to a definitive 
theory of stability for even the simplest nontrivial time-varying non- 
linear feedback systems. The general problem is, of course, one of con- 
siderable difficulty. 

The unparalleled utility of the Nyquist stability criterion for single- 
loop, linear, time-invariant feedback systems is directly attributable 
to the fact that it is an explicit frequency-domain condition. The Nyquist 
locus not only indicates the stability or instability of a system, it pre- 
sents the information in such a way as to aid the designer in arriving at 
a suitable design. The criterion is useful even in cases in which the system 
is so complicated that a sufficiently accurate analysis is not feasible, 
since experimental measurements can be used to construct the loop-gain 
locus. 

The primary purpose of this article is to point out that some recently 
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obtained mathematical results,! not involving the theory of Lyapunov, 
imply that a condition similar to, and possessing the advantages of, the 
Nyquist criterion guarantees the stability (im an important sense) of 
feedback systems containing a single time-varying nonlinear element.*> f 


II. THE PHYSICAL SYSTEM AND DEFINITION OF L2-STABILITY 


Consider the feedback system of Fig. 1. We shall restrict our discus- 
sion throughout to cases in which g,, f, u, and v denote real-valued 
measurable functions of ¢ defined for ¢ = 0. 

The block labeled Y is assumed to represent a memoryless time- 
varying (not necessarily linear) element that introduces the con- 
straint u(t) = y[f(t),¢], in which y(2,t) is a function of x and ¢ with the 





Fig. 1 — Nonlinear feedback system. 


properties that ¥(0,f) = 0 for ¢ = 0 and there exist a positive constant 
8 and a real constant a such that 


eV 25 a6 
x 
for all real « ~ 0. In particular, we permit the extreme cases in which 
y(x,t) is either independent of ¢ or linear in z [i.e., Y(z,t) = W(1,t)a]. 
The block labeled K represents the linear time-invariant portion of 
the forward path. It is assumed to introduce the constraint 


v(t) = i. k(t — r)u(r)dr — ge(t), t20 


in which & and g: are real-valued functions such that 


[ew Cees TZO Pgh Zoe, (1) 


* The results of Ref. 1 relate to feedback systems containing an arbitrary finite 
number of time-varying nonlinear elements, but, with the exception of the case 
discussed here, they do not admit of a simple geometric interpretation. 

| For results concerned with frequency-domain conditions for the global asymp- 
totic stability (a sense of stability that is different from the one considered here) 
of nonlinear systems, see, for example, Refs. 2-4. 
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The function g, takes into account the initial conditions at t = 0. Our 
assumptions regarding K are satisfied, for example, if, as is often the 
case, u and v are related by a differential equation of the form 


N dv N-1 d’u 
a Ded pe > 
2 On Te 2. na '20 


in which the a, and the b, are constants with ay # 0, and 


N 

> as” #0 for Refs] = 0. 

n=0 
However, we do not require that uw and v be related by a differential 
equation (or by a system of differential equations). 
Assumption: We shall assume throughout that the response v is well 
defined and satisfies the inequality 


[ \e@ Par < « (2) 


for all finite t > 0, for each initial-condition function go that meets the 
conditions stated above and each input g; such that 


/ | gi (t) |? dt neo 
0 


Although this assumption plays an important role in the proof of the 
theorem to be presented, from an engineering viewpoint it is a trivial 
restriction (see Ref. 5). 
Definition: We shall say that the feedback system of Fig. 1 is ‘“‘L.-stable”’ 
if and only if there exists a positive constant p with the property that 
the response v satisfies 


(flo Fat) s o( [lato + oto Par) + (flat Fax) 


for every initial-condition function g. that meets the conditions stated 
above, and every input g; such that 


i | gi (t) |’ at Se. 
0 


In particular, if the system is £.-stable, then the response is square- 
integrable whenever the input is square-integrable. 

It can be shown” that the response v(t) approaches zero as t > © 
for any square-integrable input g: , provided that the system is £.-stable, 


* See the proof of Theorem 6 of Ref. 1. 
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g(t) ~Oast— ~, and 
[ ko Pa< @. (3) 


In addition, it follows at once from the Schwarz inequality that the 
response v(t) is uniformly bounded on [0,<) for any square-integrable 
input gi, provided that the system is £,-stable, g2(t) is uniformly 
bounded on [0,2 ), and (3) is satisfied. 


Ill. SUFFICIENT CONDITIONS FOR & 9-STABILITY 


Theorem: Let 
K(tw) = k(t)e **' dt, —eodwc o, 
0 


The feedback system of Fig. 1 is &-stable if one of the following three condi- 
tions ts satisfied: 

(7) a > 0; and the locus of K(iw) for —~ <w < & (a) les out- 
side the circle Cy of radius 4(«' — B”*) centered on the real axis of the 
complex plane at [—3(a* + B*),0], and (b) does not encircle C, (see 
Fig. 2) 

(ii) a = 0, and Re[K(iw)] > —B™ for all real w 

(417) a < 0, and the locus of K(iw) for —~ <w < & ts contained 
within the circle Cz of radius 3(B8-' — a) centered on the real axis of the 
complex plane at [—3(a + B’),0] (see Fig. 3). 

Proof: Note first that 


[ | w(t) i dt Ss max (8", la \*) i if) P dt, 





| 
| 
Sa: 
B 


| 
4 


Fig. 2— Location of the ‘‘critical circle’? Ci in the complex plane (a > 0). 
The feedback system is £2-stable if the locus of K(tw) for —-~© <w< © lies outside 
C, and does not encircle C; . 
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Fig. 3 — Location of the ‘‘critical circle’? Cz in the complex plane (a < 0). 
The feedback system is £2-stable if the locus of K(iw) for -© <w < © is con- 
tained within C2. 





and hence, by a well-known result, 


[| wc vucar [aes (Feo iar) [leo Pa 


< max(attaF)([ 1a lar) [iso Pat 


Using Minkowski’s inequality, 
2 \i 
*) 


(fiw Pa) s (f° 


+ (fia Fat)! = maxie,lal) [|e 1a 


(f 190 P at) + (J | g(t) av). 


Consider now the relation between (gi + gz) and f: 








IA 


i Miu ae 








nt) + ol) =f) + f ke WUC rldr, 620 
and suppose that 
[ | a) + go(t) P dt < @. 


According to the results of Ref. 1, our assumptions” imply that there 
* In Ref. 1 it is assumed that 


[190 [>dr < 
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exists a positive constant p; (which does not depend upon q or gz) 
such that 


[is Parca [ln +o Pat 
0 0 
provided that, with 
KGy= [ k(t)e** dt 
0 


and w = Im{sl, 
(7) 1+4(a+ 8)K(s) # 0 for Re[s] 2 0, and 
(it) 3(B — a) Pe | K(iw)[1 + 3(e + B)K(iw)J™ | <1. 


Thus the feedback system of Fig. 1 is £.-stable if conditions (7) and 
(77) are satisfied. 

According to the well-known theorem of complex-function theory 
that leads to the Nyquist criterion, condition (7) is satisfied if (and 
only if) the polar plot of K(iw) for —% < w < does not encircle 
or pass through the point [—2(a + 8)~’,0]. It can easily be verified 
that condition (7) is met if one of the following three conditions is 
satisfied. 

(a) a > 0, and the locus of K(tw) for —~” < w < © lies outside 
the circle C, of radius 3(a@’ — 6’) centered in the complex plane at 
[—3(a@ + 8”),0). 

(b) a = 0, and Re[K(iw)] > —6™ for all real w. 

(ec) a < 0, and the locus of K(iw) for —2 < w < © is contained 
within the circle C. of radius 4(8"' — a) centered in the complex 
plane at [-3(a' + 6 °),0). 

If a > 0, the point [—2(a@ + 8) *,0] lies on the real-axis diameter of 
C,, while if condition (b) or (c) is met, it is impossible for the polar 
plot of K(iw) to encircle the point [—2(a + 8)',0]. Therefore, the 
conditions of the theorem guarantee that the feedback system is L.- 
stable. 


Remarks 


With regard to the necessity of our sufficient conditions for £,-stability, 
consider, for example, the case in which a > O and suppose, for sim- 
plicity, that v and wu are related by a differential equation of the type 
mentioned in Section II. Then, a moment’s reflection shows that there 
exists a y(a,t), in fact a w(2,t) which is independent of ¢ and linear in z, 





for all finite ¢ > 0. Our assumption that (2) is satisfied for all finite £ > 0 implies 
that this condition is met. 
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that satisfies our assumptions and for which the feedback system is 
not &2-stable, provided that for some value of w, K(tw#) is a point on 
the real-axis diameter of C,. This clearly shows that the condition is 
in the correct ‘‘ball park.” Similar remarks can be made concerning our 
conditions for the cases in which a < 0 anda = 0. 


IV. FURTHER PROPERTIES OF THE FEEDBACK SYSTEM OF FIG. 1 


It is possible to say much more about the properties of the feedback 
system on the basis of frequency-domain information if our assump- 
tions regarding ¥(2,t) are strengthened. 

For example, suppose that 





ne W (21,0) = (t,t) < B, v(0,E) e251) (4) 


UX — 


for ¢ = 0 and all real 2} # 2, and that one of the three conditions of 
our theorem is met. Let g, and g, denote two arbitrary input functions 
such that 


t t 
I lgi(r) dr < « and i; lgi(r) Pdr < 
for all finite t > 0, and 
I lg(r) — girlz) |? dr < ow, 


Let v and 6, respectively, denote the (assumed well defined) responses 
due to g: and g; . Then if 


t t 
[io dp iece Gnd [ \o@ Pdr < 
0 0 


for all finite t > 0, and the assumptions of Section II are met, it follows” 
that 


[ io) — d(r) ? dr < 0 


and that there exists a positive constant \ (which does not depend upon 
gi or gi) such that 


[ie 76) hae = fin) ~ alr) P ar. 


* Consider Theorem 1 of Ref. 6 with hi(é) = fi(t) = O for t < 0. 
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Suppose now that y¥(2,t) satisfies (4) and is either independent of 
t or periodic in ¢ with period 7 for each x, and that one of the three 
conditions of our theorem is met. Assume that the initial-condition 
function g.(t) approaches zero ast > , and that the input g,(¢) applied 
at t = 0 is a bounded periodic function with period 7. Then it can be 
shown” that there exists a bounded periodic function p, with period 7, 
which is independent of g. and such that the (assumed well defined) 
response v(t) approaches p(t) as t — «, provided that the conditions 
of Section II are met, (2) is satisfied for all finite t > 0, and 


Ield 
Observe that the conditions of (5) are satisfied if u and v are related by 
a differential equation of the form described in Section IT. 





k(r) | dr "at < 0, ie (1 + k(t) P dt < o, (5) 
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The Resistance of an Infinite Slab 
with a Disk Electrode 


By G. F. FOXHALL and J. A. LEWIS 
(Manuscript received April 17, 1964) 


We consider the resistance of an infinite slab measured between an elec- 
trode covering one face and a circular electrode attached to the other face by a 
uniform resistive film representing a contact resistance. Upper and lower 
bounds are found on the difference between the total resistance and the re- 
sistance of the film alone. The bounds are obtained by a combination of 
analysis and experiment, using an electrolytic tank. The results may be ap- 
plied to determine contact resistance from a measured value of total resistance 
and a knowledge of the bulk resistivity of the slab material. 


I, INTRODUCTION 


We consider the resistance of an infinite conducting slab as measured 
between an electrode entirely covering one face and a circular electrode 
affixed to the other face by a resistive film. This resistance can be 
imagined to be made up of two resistances in series: namely, the film con- 
tact resistance and a resistance which is due to the body to which the 
electrode is attached, but which depends on the film resistance. 

Lewis! has derived general upper and lower bounds on this body resist- 
ance. In the present case the upper bound may be calculated analyti- 
cally. The lower bound is the resistance which would exist between the 
electrodes in the absence of the film. Calculation of the last-mentioned 
resistance involves a classical potential problem treated by Weber in 
1873, but still not completely solved today. 

We treat this problem by a combination of analysis and experiment, 
the latter in effect being an analog computation using an electrolytic 
tank. An asymptotic solution is found which converges rapidly for slab 
thicknesses as small as one disk radius, while for smaller thicknesses ex- 
perimentally determined values of resistance are used. 

The upper and lower bounds for the body resistance, which differ only 
by 8 per cent for a thick slab and tend to the same value for a thin slab, 
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provide a convenient estimate of slab resistance in the presence of a 
contact film. More important from the practical point of view, they pro- 
vide a useful estimate of contact resistance for measured total resistance. 


Il. THE PROBLEM 


We wish to determine the electrical resistance of the slab electrode con- 
figuration shown in Fig. 1. The entire base of the slab is in perfect con- 
tact with a highly conducting, grounded electrode, while the upper 
electrode, a circular disk of radius a, is separated from the slab by a thin 





Fig. 1 — Infinite slab with disk electrode. 


film of surface conductance c. The potential V(#,Z) in the slab then satis- 
fies Laplace’s equation 


0V/dR’ + dV/ROR + &V/dZ’ = 0, (1) 
for all R and 0 < Z < H, and the boundary conditions 
V(R,0) = 0, (2) 
_ JelVo — V(R,A)I, forR <a ; 
o0V(R,H)/dZ = 0, ee (3) 
where o is the conductivity of the slab. 
The total input current J is given by 
ign i oY (RH)2aR aR, (4) 
0 OZ 


so that the resistance measured between the electrodes is 


Yeo / fo 
Rn = a! = Vo [ oo (RyH)2eR dR. (5) 
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If we set 
r = B/a; z= Z/a, h = H/a, w(rjz) = V(R,Z)/Vo, 


this becomes 


oth, = i/ [ oe (r,h)2ar dr, (G6) 
“0 02 
where 
Vw = dw/dr + dw/rdr + d’w/dze" = 0, (7) 
ndO<z<h, 
w(r,0) = 0, (8) 
for all r, and 
dw(r,h)/dz = eae 0, were, a . (9) 


It is easy to show that R,, may be written in the form 
Rn = R, + R(w), (10) 


where F, is the film resistance (1/7a’c, in the present case) and R(w) 
is the ratio of average potential difference between electrodes to total 
current, 1.€., 


1 1 
caR(w) = 2 i w(rh)r ar | f om (rh) 2rr dr. (11) 
0 0 02 


Thus, if R(w) has been calculated and 2,, measured, the film resistance 
R. may be determined. 

However, the calculation of R(w) in general involves the solution of a 
difficult mixed boundary value problem. Furthermore, since w depends 
on the film conductance c, which is essentially the quantity to be deter- 
mined by combined calculation and measurement, R(w) must be calcu- 
lated for a large number of values of c to make certain that the experi- 
mental range is covered. These difficulties can be circumvented, with 
only a moderate loss in accuracy in the present case, by the use of cer- 
tain upper and lower bounds on R(w). 


III. UPPER AND LOWER BOUNDS 
The bounds on R(w) have the form! 
R(u) S$ R(w) S Ri), (12) 
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where 
* ou 
caR(u) = i/f — (r,h)2ar dr, (13) 
0 O02 
1 
cak(v) = (2/n) | o(r,h)r dr, (14) 
0 
u and v satisfy (7) and (8), and 
u(r,h) = dv(r,h)/dz = 1, forr <1 (15) 
du(r,h)/dz = dav(r,h)/dz = 0, forr > 1. (16) 


The lower bound R(u) is the resistance as usually defined, i.e., the re- 
ciprocal of the total current for a unit potential difference applied uni- 
formly between the electrodes. On the other hand, R(v) is the average 
potential difference required to give unit total current, distributed uni- 
formly over the input electrode. The former case may be realized by let- 
ting the film conductance c become very large; the latter by letting c 
tend to zero and V, tend to infinity in such a way that cV» tends to a 
finite value. 

The calculation of R(v) is straightforward, involving only the solution 
of an unmixed boundary value problem (dv/0z specified all over z = h), 
but 2(w) involves the solution of a mixed boundary value problem and 
ultimately the solution of dual integral equations. An asymptotic 
solution of these integral equations, valid for large thickness (h > 1), 
has been obtained by Tranter.” The corresponding expression for (1) 
is rapidly convergent, so that it appears to be usable down to values of 
h of order unity. The range h < 1 is covered by measurements on an 
electrolytic tank analog. 


IV. THE UPPER BOUND 


If we set 





v(ra) = | (p) SEP Jolpr ap, (17) 


the conditions 
Vv = v(r,0) = 0 
are satisfied and the remaining conditions become 
ie fom, A 


(18) 
0, r>tl. 


dv(rh) /dz = [ pf(p)Jo(pr)dp = 
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Now 
es 1, pose od 
[ solor)(p)ap = (19) 
0 0, p> 1, 
so that, if we set 
f(p) = Ji(p)/p, 
we obtain the complete solution. In particular, 
* tanh ph 
v(rp) = [AP np) dolor dap (20) 
and 
caR(v) = (2/x) | Ws(p)/pl* tanh ph dp. (21) 


For small h, caR(v) & h/r or R(v) & H/ra’c, the resistance of a cir- 
cular cylinder, while for large h, caR(v) ~ 8/37’, a result derived in 
Carslaw and Jaeger.° 

Miss M. C. Gray has obtained an expansion in powers of h~’ whose 
first two terms, i.e., 


8 _ log2 


oake(y) Fe 37° = 2rh 


(22) 





give reasonable accuracy down to h = 1. She has also evaluated the in- 
tegral for R(v) numerically for 0.1 < h < 10. This is the curve labeled 
Rv) in Fig, 2. 


V. THE ASYMPTOTIC VALUE OF R(u) 


If we assume u(r,z) to have the same form as v(r,z) in the previous sec- 
tion, the function f(p) must now satisfy the dual integral equations 


wh) = [ 4) tanh phJo(pr)dp = 1, pe °{23) 


dur) /dz = is pf(p)Jo(pr)dp = 0, r>1. (24) 


We can no longer solve these equations by inspection, but for large h an 
approximate solution, due to Tranter,? is available. Tranter gives 


itp) (2) 4G) BL, (25) 
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where 








ROY 2 2 108 2 4 ? log =) + O(n"). (26) 


1 ah 
Yor A = 1 (hk = &) we obtain the classical result 
[rah (u))o = 4, 
and for large h 
oakk(u) & 1/4A(h). (27) 


This function is shown in Fig. 2 for h = 1. 


VI. THE EXPERIMENT 


In order to verify and supplement the computed values of resistance, 
an experiment using an electrolytic solution as the conducting slab was 
devised. The apparatus, shown in Fig. 3, consisted of a 10 X 14 inch 
plastic tank into which a gold-plated brass plate was fitted. A 0.01 normal 
IXCl solution was used to simulate the conducting slab, while the end of 
a 0.564-inch diameter gold-plated brass rod served as the disk electrode. 

The experiment consisted of two parts. First resistance measurements 
were made for the slab configuration at various solution levels. Then a 
glass sleeve, closely fitted to the upper electrode, was used to constrain 
the current in the solution to a simple cylindrical geometry. This pro- 
vided a measurement of solution conductivity and also of contact resist- 
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Fig. 2 — The resistance of an infinite slab. 
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Fig. 3 — Electrolytic tank analog. 


ance. The resulting measured values are shown in Fig. 4 as a function of 
solution depth. The slope gives a conductivity of 1.33 X 10-3 (ohm-cm)-"', 
in close agreement with the tabulated value‘ for the solution temperature 
of 22°C. Extrapolation to zero solution depth indicates a negligible con- 
tact resistance. 

As seen in Tig. 2, the measured values of resistance for the slab fall 
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Fig. 4 — The resistance of a cylinder of electrolyte. 
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very close to the computed values in the range 2 S h S 4, where the 
asymptotic form for R(w) can be expected to be accurate. At smaller 
values of h, the accuracy of the asymptotic form decreases and the meas- 
ured values may be taken as a good approximation to R(u). At large 
values of h, on the other hand, the asymptotic form becomes very ac- 
curate, while the experimental model becomes less so. This may be seen 
in Fig. 2 forh & 5. 

This divergence, which at first was attributed to the finite diameter of 
the tank, is now believed to be due to polarization effects produced by 
the nonuniform field near the upper electrode. The abrupt upturn of 
resistance near h = 5 cannot be due to finite tank diameter, which would 
yield a resistanee-depth curve with a slope decreasing from a value for 
a cylinder having the same diameter as the upper electrode to one for a 
long cylinder having the same diameter as the tank. On the other hand, 
the current density distribution over the upper electrode is nonuniform 
for any depth, being infinite at the electrode edge in the mathematical 
idealization. This nonuniformity increases with depth, for more current 
is drawn from the electrode center at small depth than at large depth. 
Thus for fixed total current (the experimental condition) the current 
density at the edge increases with increasing depth until a depth is 
reached (h & 5 in the experiments) at which local polarization effects 
become appreciable. This dependence on depth also accounts for the 
different behavior of measured values for the cylinder (Tig. 4) and the 
slab (Fig. 2). 














NORMALIZED RESISTANCE, agR 











0.1 0.2 04 06 1 2 4 6 8 10 
THICKNESS RATIO, h 


Fig. 5 — Upper and lower bounds on the resistance of an infinite slab. 
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VIT. SUMMARY OF RESULTS 


Tig. 5 summarizes the results of the analysis and the experiment. 
The lower curve, labeled R(u), gives the resistance between the elec- 
trodes for zero contact resistance. It also gives a lower bound on the 
difference between the total resistance R,, with a resistive film between 
the upper electrode and the slab and the resistance R, = 1/ra’c of the 
film itself, while the upper curve, labeled R(v), gives an upper bound on 
the same quantity. Thus, for all slab thicknesses and (constant) film 
conductances c, 


R(u) S Ra — Rk. S Rv), (28) 
or, assuming F,, to be determined by measurement, 
Rn — Riv) SR. S Run — R(t) (29) 


giving an estimate of the contact resistance itself. 
To supplement Fig. 3, the asymptotic forms 


oaR(u) ~ (0.250)/[1 + (0.441/h) + (0.441/h)"I, (30) 
oaR(v) ~ (0.270) + (0.110/h), (31) 


may be used for thickness ratio h = 1, while, for h S 0.1, both caR(u) 
and caR(v) are closely approximated by the cylinder resistance h/z. 
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Permutation Groups, Complexes, and 
Rearrangeable Connecting Networks 


By V. E. BENES 
(Manuscript received March 12, 1964) 


In the interest of providing good telephone service with efficient connecting 
networks, it ts desirable to have at hand a knowledge of some of the com- 
binatorial properties of such networks. One of these properties ts rearrange- 
ability: a connecting network is rearrangeable if its permitted states realize 
every assignment of inlets to outlets, or alternatively, if given any state x of 
the network, any inlet idle in x, and any outlet idle in x, there 1s a way of 
assigning new routes (if necessary) to the calls in progress in x so that the 
idle inlet can be connected to the idle outlet. 

A natural algebraic and combinatorial approach to the study of rear- 
rangeable networks ts described, with attention centered principally on two- 
sided networks built of stages of square crossbar switches, cach stage having 
N inlets and N outlets. The approach is based in part on the elementary 
theory of permutation groups. The principal problem posed (and partly 
answered) is this: What connecting networks built of stages are rearrange- 
able? Sufficient conditions, including all previously known results, are 
formulated and exemplified. 


I. INTRODUCTION 


A connecting network is an arrangement of switches and transmission 
links through which certain terminals can be connected together in 
many combinations. Typical examples of connecting networks can be 
found in telephone central offices, where they are used to complete calls 
among the customers themselves, and between customers and outgoing 
trunks leading to other offices. 

In the interest of providing good service with efficient connecting net- 
works, it is desirable to have a thorough understanding of some of the 
combinatorial properties of such networks. In a previous paper,! we 
singled out three such combinatory properties as useful in assessing the 
performance of connecting networks. The weakest of these properties 


1619 


1620 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


was that of rearrangeability. A connecting network is rearrangeable if its 
permitted states realize every assignment of inlets to outlets, or alterna- 
tively, if given any state x of the network, any inlet idle in xz, and any 
outlet idle in x, there is a way of assigning new routes (if necessary) to 
the calls in progress in x so as to lead to a new state of the network in 
which the idle inlet can be connected to the idle outlet. 

Figs. 1 and 2 show the structure of two connccting networks built out 
of square crossbar switches, with each switch capable of connecting any 
subset of its inlets to an equinumerous subset of its outlets in any de- 
sired one-one combination. The network of Fig. 1 is often found in tele- 
phone central offices; we may call it the No. 5 crossbar network. It is 
not rearrangeable. The network of Tig. 2 7s rearrangeable, but so far it 
has not found extensive practical use. 

We shall describe a natural algebraic and combinatorial approach to 
the study of rearrangeability. For the most part we restrict attention to 
two-sided connecting networks that are built of stages of crossbar 
switches, and have the same number N of inlets as outlets. The approach 
is based in part on the elementary theory of permutation groups. The 
way the connection with group theory arises can be summarized as 
follows: a maximal state of the network is one in which no additional 


- CROSSBAR 
/ SWITCH 


CUSTOMERS 
LINES TRUNKS 





LINE LINK FRAMES TRUNK LINK FRAMES 


Fig. 1 — Structure of No. 5 crossbar network. 
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Fig. 2 — Rearrangeable network. 


calls can be completed in the network; suppose that both the inlets and 
the outlets are numbered in an arbitrary way from 1 to N; each maximal 
state realizes some submap of a permutation on {1, --- , N}; the net- 
work is rearrangeable if and only if the whole group of all permutations 
of {1, --- , N} is generated in this way by the maximal states of the 
network. Details are worked out in the main body of the paper. 

It is not possible to explore in one paper all the possible uses of group — 
theory in the study of connecting networks. Indeed, we shall restrict 
ourselves to formulating the fundamental problem of rearrangeable 
networks in terms of complexes of permutations, and to giving a partial 
answer. One of the difficulties with the approach is that it always seems 
to be easier to obtain results about groups by the few available methods 
known for rearrangeable networks, than vice versa. 

A sequel? to the present paper is concerned with the problem of syn- 
thesizing a rearrangeable network (for N inlets and outlets), subject to 
certain structural conditions and to the condition that it have a minimum 
number of crosspoints. 


II. SUMMARY 


In Section III we define a precise general notion of a “stage” of switch- 
ing in a connecting network, and, after describing how the networks 
which will be of interest are built out of stages by joining them together 
by patterns of links, we pose two problems: first, to discover what net- 
works built in this way are rearrangeable; and second, to synthesize 
optimal rearrangeable networks of given size, optimal in the sense of 
having fewest crosspoints (among those in some class of networks having 
practical interest). (See Ref. 2.) 

Section IV is devoted to giving a formulation of the first problem 
(discovering rearrangeable networks) in terms of partitions and permuta- 
tion groups, using the notion of stage. In Section V we discuss how stages 
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generate complexes (in the group theory sense, 1.e., sets of group ele- 
ments). It is shown that a stage can generate a subgroup only if it con- 
tains a substage made of square switches, a result that indicates to some 
extent the ‘‘best possible” nature of stages made of square switches. 
A known example, discussed in Section VI, indicates how a particular 
symmetric group S is generated by a rearrangeable network in the form 


s~ {fl 


ql 


with ¢: , ¢3 in a subgroup H and ¢ in a certain subgroup ¢ He conjugate 
to A. 

The remainder of the paper is devoted to proving two “rearrangeabil- 
ity” theorems for connecting networks built of stages of square switches. 
The first theorem gives sufficient conditions under which a set of stages 
of square switches connected by link patterns will give rise to a rear- 
rangeable network. The second theorem indicates a simple way of 
describing link patterns and stages that satisfy the hypotheses of the 
first theorem, and so yield many specific rearrangeable networks, gener- 
alizations of those given by Paull.’ 


III. STAGES AND LINK PATTERNS 


The switches in Figs. 1 and 2 are arranged in columns which we shall 
call stages, the switches in these stages being identical. Two adjacent 
stages are connected by a pattern of links or junctors. Along with the 
switches, the link patterns are responsible for the distributive charac- 
teristics of the network. They afford an inlet ways of reaching many 
outlets. Obviously, each outlet on a switch in a given stage is some inlet 
of the next stage, if there is one. Suppose that the NV inlets are numbered 
in an arbitrary way, and that the N outlets are also numbered in an 
arbitrary way, both from 1 to N. Then it is clear that each link pattern, 
and each permitted way of closing the largest possible number of cross- 
points in a stage, viz. N, can be viewed abstractly as a permutation on 
{1, --- , N}. Both the networks in Figs. 1 and 2 have the property that 
all maximal* states have the same number A of calls in progress, and 
any such maximal state realizes a permutation which is a product of 
certain of the permutations represented by the link patterns and the 
stages. 

It will be convenient to generalize the usual notion of a “stage” of 
switching in a connecting network. By a stage (of switching) we shall 


* T.e., states in which no additional calls can be completed. 
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mean a connecting network constructed as follows: with I the set of 
inlets, and Q that of outlets, we choose an arbitrary subset $ of I X Q, 
and we place a crosspoint between all and only those inlets u ¢« J and 
outlets v « Q such that (u,v) ¢ 8. We shall also speak of § itself as the 
“‘stage.’’ Thus we make 


Definition 1: A stage is a subset of J X Q. 

This terminology is easily seen to be an extension of the usual one, 
according to which, e.g., a column of switches in Fig. 1 forms a stage, 
the network having four stages separated (or joined) by three link pat- 
terns. Actually, a link pattern may be associated with one or the other 
(but usually not both) of the stages it connects, to define a new stage; 
we do not usually do this. 


Definition 2: A stage S is made of square switches if and only if there is 


a partition II of {1, --- , N} such that 
s= U (AXA). 
AeTl 


Definition 3: A substage 8’ of $ is a subset of 8. 

Except in the trivial case in which § is actually a square N-by-N 
switch (i.e.,8 = I X Q), a stage $ will not by itself give rise to a rear- 
rangeable network. Still, it is known that several stages joined end to 
end by suitable link patterns can together give rise to such a network, 
e.g., that of Fig. 2. We can thus formulate two fundamental questions 
about connecting networks built out of stages: 

(1) What stages and link patterns can be used to construct a rearrange- 
able network? 

(2) What stages, and how many of them, should be used to construct 
a rearrangeable network that has a minimum number of crosspoints 
(switches) for a given number of terminals on a side? 

Question (1) is studied in the present work, while question (2) is 
treated in another paper. 


IV. GROUP THEORY FORMULATION 


We shall adopt some notational conventions from group theory to 
simplify our presentation. Let G be a group. It is customary to speak of 
a subset K € Gas acomplex. If x eG, then xK denotes the set of products 
xy with y e K, Kx denotes the set of products yx with y e K. Similarly, 
if K, and Ke are complexes, K,Ke2 denotes the set of products yz with 
y ¢ Ki, and ze Ke. 

A group G of permutations is called imprimitive’ if the objects acted 
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on by the permutations of G can be partitioned into mutually disjoint 
sets, called the sets of «mprimitivity, such that every ¢ ¢ G either per- 
mutes the elements of a set among themselves, or carries that set onto 
another. That is, there is a nontrivial partition II of the set X of objects 
acted on such that ¢ e G and A e II imply ¢(A) e II. We shall extend 
this terminology as follows: 


Definition 4: G is called strictly imprimitive if it is imprimitive, and 
each set A of imprimitivity is carried into itself by elements of G, i.e., 
there is a nontrivial partition II of X such that A ¢ II implies g(A) = A 
for all g € G, so that ¢ ¢ G is “nonmixing” on I. 

Consider a stage of switching that has N inlets and N outlets. It is 
evident that such a stage provides ways of connecting some of the inlets 
to some of the outlets. If the stage contains enough crosspoints it can 
be used to connect every inlet to some outlet in a one-to-one fashion, 
i.e., with no inlet connected to more than one outlet and vice versa. 
With the inlets and outlets both numbered 1, 2, --- , N, sucha setting 
of the switches corresponds to a permutation on {1, --- , N}. Indeed, 
there may be many ways of doing this, differing in what inlets are con- 
nected to what outlets, that is, corresponding to different permutations. 


Definition 5: A stage $ generates the permutation ¢ if there is a setting 
of N switches of $8 which connects each inlet to one and only one outlet 
in such a way that 7 is connected to g(z)z = 1, --- , N, that is, if 


(7,0(2)) €8. 


Definition 6: The set of permutations gencrated by a stage $ is denoted 
by P(S). 


Definition 7: A network (with N inlets and N outlets) generates a per- 
mutation ¢ if there is a setting of the switches in the network which 
connects, by mutually disjoint paths, each inlet to one and only one out- 
let in such a way that 7 is connected to g(z),7 = 1, ---, N. 

If two stages $; , S: are connected by a link pattern corresponding to a 
permutation g2 , then the permutations that they generate together are 
those of the form 


L1P2¢3 , gi €e P(S8:), 4=1or3. 


If a network consists of two stages 81 , 82 joined by a link pattern cor- 
responding to a permutation y, then it can be seen that it generates 
exactly the permutations in the set 


P(81)¢P (82). 
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A network of s stages 8;,7 = 1, --- , s, With a link pattern corresponding 
to gi, 27 = 1, ---,s — 1, between the 7th and the (¢ + 1)th stages, 
generates the complex 


P(81)¢iP (82) +++ gs1P (8s). 
We shall occasionally use the suggestive notation 
S1¢182 +++ Gs—18s 


to refer to or indicate such a network. 

It is now possible to formulate a group-theoretic approach to the 
analysis and synthesis of rearrangeable connecting networks made of 
stages of switching joined by link patterns. Consider such a network, 
generating the complex 


P(81)¢1 ok gs1P (85). 


The factors 9;P(Si41),7 = 1, --- ,s — 1, occurring herein are themselves 
again just complexes. Thus, given any product of complexes 


II Kk: 
i=1 


we seek to know whether the product is the whole symmetric group, and 
whether the factor complexes K; can be written in the form 


eP(8) 


where ¢ is a permutation and § is a stage. In this general form the 
problem is largely unsolved; however, special cases are worked out in 
the sequel. 


Vv. THE GENERATION OF COMPLEXES BY STAGES 
We start with this elementary result: 


Remark 1: Let M be a complex (i.e., a set) of permutations. Define a 
stage S by 


S = {(a,y): ¢(2) = y for some g e VY}. 


Then P(S) DM and no smaller stage has this property. 

In cases of practical importance, such as shown in Figs. 1 and 2, the 
stages are made of square switches, and it is clear that a stage $ (with NV 
inlets and N outlets) is capable of effecting certain special permutations 
on X = {1, --- , N}, and of course, all submaps thereof. (Indeed, for 
each switch there are numbers m and n with m < n such that the switch 
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is capable of performing all the (n — m + 1)! permutations of the 
numbers k in the range m S k S n among themselves.) Since no inlet 
[outlet] is on more than one switch, these permutations form a subgroup 
of the symmetric group S(X) of all permutations on {1, --- , N}. This 
subgroup has a property which might be described intuitively by saying 
that there exist sets on which the subgroup elements can mix “strongly,” 
but which they keep separate. It is apparent, indeed, that the subgroup 
generated by a stage made of square switches is strictly imprimitive, the 
sets of imprimitivity being just the elements of the partition IT of {1, 

- , N} according to what switch an inlet [outlet] is on. This situation 
might also be described by saying that a permutation ¢ from the sub- 
group is nonmiaing on II. 

Our second observation is 


Remark 2: Let H be a strictly imprimitive group of permutations on 


X = {1, --- , N}, with sets of imprimitivity forming the partition II. Let 
S be the smallest stage with P(S) D H. Then 
s= U AXA, 
Aell 


i.e., S is made of square switches. 

The main result of this section states that a stage can generate a sub- 
group only if it contains a substage made of square switches. This sug- 
gests that stages made of square switches necessarily arise in the genera- 
tion of the symmetric group by products of complexes some of which are 
subgroups. 


Theorem 1: Let 8 be a stage, and let P(S) contain a subgroup H of S(X). 
Then there is a substage ® of 8 which is made of square switches. 


Proof: Define a relation ® on {1, --: , N} by the condition that 7®j7 if 
and only if 7 = ¢(2) for some ¢ e H. Since H is a subgroup, it must con- 
tain the identity permutation, i.e., 7Rz for alla = 1, --- , N. Let i,j,k 
be numbers in {1, --- , N} such that 7 = ¢(7) and k = y(j) for some 
permutations ¢, yy e H. Then ye e H and k = yo(t), that is, 7Rk; hence 
® is transitive. Finally, if 7 = o(7) with 9 e H, we havez = ¢ '(j) with 
¢ | ¢H, since H isa group. Hence @ is an equivalence relation, and there 
is a partition II such that 


a= U (AXA). 
AeTll 
Since 7@7 obviously implies (7,7) ¢ $8, we have 
R CS. 


® is clearly a substage of § made of square switches. 
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VI. AN EXAMPLE 


As is well-known, elementary group theory contains many results that 
allow one to write a group as a product of complexes. These results often 
involve a subgroup of the group in question. We shall quote an elementary 
result of this kind, and interpret it in terms of a network that is known 
to be rearrangeable. 

Let G be a group, and let H; and H» be subgroups of G, not necessarily 
distinct. A double coset is a complex of the form 


AyeH, , geG 


It is a known result’ that two double cosets are either identical or dis- 
joint. Thus there is at least one complex M/ with the properties 


U H,woH, = G 


geéewM 


HywH, a HwH, 


6 ify #¥y, withg,pe M. 
In particular 
G = H,MH., 


and we have factored G into a product of three complexes, two of which 
are subgroups. Now suppose that G is actually S(X), the symmetric group 
of all permutations of N objects, and that m and n are positive integers 
such that mn = N. Let II be a partition of X = {1, --- , N} into m sets 
of n elements each, and let H be the largest strictly imprimitive sub- 
group of S(X) whose sets of imprimitivity form II. Also let ¢ be a self- 
inverse permutation, and II, a partition, such that A ¢ I, B e I, imply* 


|o(A) Bl = 1. 
Let K be the largest strictly imprimitive subgroup of S(X) whose sets 
of imprimitivity form II, . 
By Remark 2, Section IV, H and K can each be generated by stages 
of square switches. 


Returning to the earlier discussion leading to the factorization G = 
HMA, we |let H; = H, = H. Now it can be seen that the complex 


HeKkeH 


is generated by a network of the form shown in Fig. 2. By the Slepian- 
Duguid theorem (Benes, Ref. 1, p. 1484) this network is rearrangeable, 
so that 


HeKeH = S(X). 


* | A | denotes the number of elements of a set A. 
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Since ¢ = ¢ ’, the complex ¢K¢ is itself actually the subgroup ¢ ‘Ky 

conjugate to K. Thus for G = S(X) and H; = Hz = K, the factor M in 
S(X) = HMH 


can be chosen to be gy ‘Ke. 


VII. SOME DEFINITIONS 


The number of elements of a set A is denoted | A |. Let X, Y be 
arbitrary finite sets with | X | = | Y |, let B be a subset of Y, let I; ,Us 
be partitions of X, Y respectively, and let y be a one-to-one map of Y 
onto X. Let 6 be the null set. 


Definition 8: 9(B) = {a e X:¢ '(x) ¢ B}. 
Definition 9: ¢ hits 1, from B if and only if A eT, implies¢(B) NA = 8@. 


Definition 10: ¢ covers 1, from Tz if and only if B e II; implies ¢ hits II, 
from B. 


Definition 11: ¢(IIe) is the partition of xX induced by ¢ acting on ele- 
ments of Iz , 1.e., 


g(Iz) = {o(B): B e Th}. 

Definition 12: sg is the restriction of ¢ to B. 

Let A be a subset of X. 

Definition 13: 4T is the partition of A induced by I], , i.e., 
all, = {C f) A: C e€ 1h}. 


Definition 14: gy B-covers II, from Tp if and only if z¢ covers g¢zyII, from 
ple. 


Definition 15: Let Ip , I, be partitions of X. Then II, > Wo (read “pi- 
one refines pi-zero’’) if and only if every set in IIo is a union of sets in 
Il, , and Wy rad IIo. 


VIII. PRELIMINARY RESULTS 


Lemma 1: Let X and Y be any sets with |X| = |Y| < ©, let Th = 
{Ai, -++, An} and Il, = {B,, --- , Bn} be partitions of X and Y respec- 
tively, and suppose that fork = 1, --: , n the union of any k elements of 


II, has more elements than the union of any k — 1 elements of Tp. Then 


(4) m= n. 


PERMUTATION GROUPS 1629 


(iz) For each one-to-one map f of X onto Y there exists a set of n dis- 
tinct integers k(1), +--+ , k(n) with 1 S k(t) S m,i =1,---,n, and 
f(A) N Bew 4 0 t7=1,---,n. 
Proof: Since Il, and IT, are partitions, and | X | = | Y |, 


Dl4el = DBs 


If m were less than n, then the union of m B’s has as many elements as 
the union of n A’s, form < n; this contradicts the hypothesis. Let 
K; = {j: f(1) ¢ B; for some / € A}, i=1,---,n. 

, Ai be any & elements of I,, 1S k S n, and set 


Also let Ai) eases 
k 
P= Kis. 
jut 


All of the 


k 
py | Axa | 
j=1 


elements of 
k 

U Ais 

j=l 


are mapped by f into | 7’ | sets of Il, . Since no union of k — 1 sets of I. 


has 


k 
2 [Ais | 
j=l 


elements, it follows that | 7 | = &. Thus the union of any & of the sets 
{k;,7 = 1,---,n} has at least k members. Hence by P. Hall’s theorem® 


there is a set of n distinct representatives k(1), --- , k(n) with 
k(i) ¢ K; t=I1,---,n 
k(t) # k(j) fort ¥ 7. 
But clearly k(z) e K; if and only if 
I) € Broa 


for somel e A;. 


that is, if and only if 
f(A: ON Bin # 6. 
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Lemma 2: Let 11, , Wz be partitions of sets X, Y respectively with the proper- 
ties | X| = | Y | and 


C1, Ce e Il, U Il, implies | C; | = | Ce |. 
Then for every one-to-one map ¢ of X onto Y there is a set D € X such that 
y hits Il, from D and ¢° hits Il, from o(D). 


Proof: We observe that | TI, | = | Iz |, and that the conditions of Lemma 
1 are satisfied, with m = n. For each onto map ¢ there is a set D © X 
with | D| = | I, |, such that 


A ell,implies DNA # 8, (¢ hits Il, from ¢(D)) 

B «Tl, implies g(D) NB # 8, (v hits Iz from D). 
Lemma, 3: Let X be any set and let Tl, and Miz be partitions of X such that 
A,B ¢ 1 UT, implies | A| = | B|. Then for every permutation ¢ on X 


there is a partition I, of X such that (7) ¢ covers I from Il,, (ti) @* 
covers I; from ¢(Il,), 


| B | 
Proof: Let ¢ be a permutation on X. The hypothesis implics that the 
union of any & elements of II, has more elements than the union of any 


k-1 elements of I, , for k = 1, ---, | 1, |. Hence by Lemma 2, with 
X = Yandm = n, there isa set D; € X such that 


[ 1, | = | Ie], Belly. 


¢g hits TI, from D, 
g hits II, from ¢(D;). 


Now. we consider the sets X,; = X — D, and Y; = Y — ¢(D,) parti- 
tioned by 


x, and y,Je respectively 


and we apply Lemma 2 to x,¢, Le., to the restriction of ¢ to X — D,. 
This gives a new set D2 © X such that again 


g hits Il, from De 
gy hits II, from o(De). 


We proceed in this manner till X and Y are exhausted, and set I, = 
{Di, +++, Dz}, where n = | A | for all A ¢ 1, UT. It is clear that II, 
has the stated properties. 
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Let ¢ be a permutation on X, and let I, , Il: be partitions of X. 


Lemma 4: If ¢ covers Tl, from Ik, , and B ¢ Iz implies | B| = | Th, |, then 
A e II, implies | A | = | TI. |. 


Proof: Since ¢ covers 1; from Tz , then A ¢ II, and B e I, imply that there 
is an 2 ¢ B with ¢ ‘(a) « A. Thus¢ (x) ¢€ A for at least | I, | distinct 
values of x, and so | A | = | Ik |. Since B e Me implies | B| = | M1, ], it 
follows that | II, | divides | X | and 


P< 


Paes 
ca | Th | 





Clearly 


2, |4| = |X. 


Aelly 


Since there are | II,| sets in I, each with at least | II, | elements, 
2» [4] 21M] - [Me] = |X]. 
AeTly 


If any A ¢ II; had more than | I; | elements the sum would exceed | X |, 
which cannot be. Thus A e¢ Il; implies | A | = | Tk]. 


IX. GENERATING THE PERMUTATION GROUP 


In this section we exhibit a sufficient condition on permutations ¢, , 
+++, gs4 and stages 8, --- , 8, under which the complex 


P(81)¢1 +++ GaP (8s) 


is actually the whole symmetric group, and the corresponding network 
(obtained by linking 8; and 8,41 by g;,7 = 1, --: , s — 1) is rearrange- 
able. 

In order to focus on the mathematical character of the results, on their 
purely formal aspects divorced from physical considerations having to 
do with switches, etc., the conditions on the ¢’s and the 8’s purposely 
are phrased in a quite abstract way. Consequently, the practical implica- 
tions and applications of the result may be unclear and require discus- 
sion. This discussion is given after the theorem has been stated, and is 
followed by its proof. 


Theorem 2: Let s > 3 be an odd integer, let g1, +--+ , Gs—1 be permutations 
on X = {1,---,N}, leh, k =1,---,s,andM’,k =1,--- ,4(s — 1), 
be partitions of X, and let 
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{X} k=0 

y= Qe °° exe (IT) k= 1,--+,3(s — 1) 
\ 2 Pue4 (IT ) k= 3(s st 1), aa a aaa 1, 
{X} k=: 


Suppose that 
(i) If s = 3, then tl’ < m1", k = 0,---,4(s — 8). 
(i) Tey = WO? 
(iit) Fork = 1, --- , 4(s — 1), and every Be V"", gy * B-covers My 
from ox from o,(¥"*). 
(wv) Fork = 4(s +1), ---,s — 1, and eery B eW'™, o, B-covers 
Wigs from gy '(¥"*). 


(v) If A eM’ and Be VW UV RO™ then | oT | = | slls—rai] = 
[A|, k= 1 Poa a= 1): 
Let H., k = 1, ---, 8 be the largest strictly imprimitive subgroup of 


S(X) whose sets of imprimitivity are exactly the elements of Il, (7.e., A ¢ Tx 
implies { sv: ¢ ¢ Hy} = S(A)). Then the complex K defined by 


K = Hyg,H, fata HgsaHs 


has the property K = S(X), and any network generating this complex is 
rearrangeable. 

The theorem given above does not provide any new designs of rear- 
rangeable networks that are not already implicit in the work of M. C. 
Paull’ and D. Slepian;’ thus no new principle is involved. Rather, in 
formulating the result, we have sought insight by stating a generalized, 
purely combinatory form of these previous results. The theorem exhibits 
this generalization, first as providing a way of generating the symmetric 
group in a fixed number of multiplications of certain restricted group 
elements, and second as based on some purely abstract properties of 
some partitions and permutations. 

As in A. M. Duguid’s proof of the Slepian-Duguid theorem,’ the basic 
combinatory theorem of P. Hall on distinct representatives of subsets 
is used repeatedly. This means (roughly) that the proof proceeds by 
showing that an arbitrary permutation (to be realized in the network) 
can be decomposed into submaps each of which can be realized in a dis- 
joint part of the network, thereby not interfering with the realization 
of the other submaps. A significant departure from Ref. 3 is that we try 
to obtain rearrangeability directly from conditions that are stated for 
the network as a whole, as well as by building it up from rearrangeable 
subnetworks. 

The following intuitive guides should be useful in understanding 
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Theorem 2. The permutations ¢;, -:- , ¢s-1 are of course intended to be 
those corresponding to the link patterns between the stages of a net- 
work. The partition I, corresponds to the assignment of the terminals 
entering the kth stage to various square switches, all wu e A for A ¢ II, 
being on the same switch. The partitions 1" are used in defining the 
submaps mentioned above. 

The ‘‘covering”’ properties (727) and (2v) of the g, in Theorem 2 ensure 
(roughly) that the g are sufficiently mixing or distributive to be able 
to generate all permutations in the restricted ways permitted in the 
definition of K. They are generalizations of the property, exhibited in 
Fig. 2, that every middle switch is connected to every side switch by a 
link. The property (v), finally, implies that various sets of switches all 
have the same cardinality; this ensures (again, roughly) that if a cross- 
point is not being used for a connection between one inlet-outlet pair, 
then it can be used for a connection between some other pair. 


Proof of Theorem 2: We use induction on odd s = 3. If s = 38, there is 
only one II”, viz. Il’. Let g be a permutation; we show that 


Pe HT iil. poll 3. 


The argument to be given is constructive, in that we do not use proof by 
contradiction, but actually give a kind of recipe for finding three per- 
mutations 7; « H;,7 = 1, 2, 3, with 

P= MP1NW273 - 


To prove the theorem for s = 38, it is enough for z = 1, 2, 3 to exhibit 
a partition II(z) and to define y; on A ¢ II(2), i.e., to give 


ANi 5 A eII(2), 4 ee 
Condition (v) fork = 1 (=4(s — 1) here) tells us that for A ¢ I 
[| =|T| =| Al. 


However, the ‘‘middle-stage’”’ condition (27) states that I, = II. 
Hence | 11, | - | Hz] = N. Since [condition (727) now] gr * covers TI; from 
a(t’) = gg, (Il’) = Ik, it follows that B ¢ TM, implies | B| = | Ih |. 
If for some B ¢ Il; it was true that | B| < | Ik|, then 


> |B) <|th]-|m)=N 
Belly 
which is impossible. Thus | B| = | Il. | for B ¢ I, . In exactly the same 


way, using condition (7v), we find that C e¢ Tl; implies |C | = | I]. 
Therefore B, C ¢ I; U Il; implies | B| = | C |. 
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Returning now to the chosen permutation ¢g, we apply Lemma 3 to 
conclude that there exists a partition Il, of X such that ¢ covers II; from 
Il,,¢ covers Ih from g(I,), | U,| = | A | for A eT, UT, and| BI = 
| 1,| = | Is] for B e I,. Hence also | 1,| = | I |. 

Let uw: Il, © Ty be any map of T, onto IIz. The desired partitions 
II(z), 7 = 1, 2, 3 will be taken to be 


II(1) — gi (IIe) 


1(2) = {u(D): De T1,} = Me 
11(3) = I, 
and the desired permutations 7;, 7 = 1, 2, 3, are defined so as to 


have these properties: for D ¢ I, 
ns: 9(D) — 2 '(u(D)) 
m :¢i(u(D)) > D 
no: g2mx(D) orm ‘¢(D). 


That this can be done (uniquely, indeed) can be seen as follows: Let 
D¢T,, and p(D) = Be Ik. Since ¢ ' covers I from g(II,), and ¢ 
covers II; from II, , it must be true that 

(1) ¢ hits I, from ¢(D) 

(2) ¢ hits II; from D. 

But at the same time, by conditions (777) and (zv), and the fact that 
I’ = Il, ¢, covers Il; from Il, and g covers IIs from II, , and so 

(3) ¢, hits Ty from B 

(4) ge hits Oi; from B. 

Thus if we Df) A and A eI, there is a unique v e A such that ¢g2(v) « B, 
and we take y3(w) = v. Similarly, if 2 = o(u) eC and C eIl,, there is a 
unique w eC such that 9; '(w) ¢ B, and we take m(w) = z. Finally, de- 
fine m2 so that n2(¢g2(v)) = g: '(w). Since u(-) is onto, each D e Il, 
deals with a unique B = yu(D) ¢ I, and the definition of 7; ,2 = 1, 2, 3 
can be made for each D and its associated B, independently of the 
others. It is apparent that 


neimems(D) = 9(D), Dell, 
or 
P= MPINW273 - 
Since 7; forz = 1, 2, 3 is onto, and is a subset of 


UEXE, 


Eell; 
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it follows that n; « H;,7 = 1, 2, 3, and thus that 
Kk = S(X). 


We now assume, as an hypothesis of induction, that the theorem is 
true for a given odd s — 2 = 3, and that we are given permutations ¢, , 
and partitions {II,} and {IL"}, satisfying the conditions of the theorem. 

No loss of generality is sustained if it is assumed that each A e II” 


is invariant under g, --- , ¢s_2. This invariance can always be achieved 
by redefining the g;, without loss of properties (727) to (v). It can now 
be seen that for k = 2, --- , s — 2 the restrictions 

all; ; all", with A € Ir’, 


satisfy all the conditions on II, , 1” (respectively) used in Theorem 2. 
Hence by the hypothesis of induction, for each A ¢ II’, the restriction of 
the complex 


ope Te so 51 


to A generates S(A). The argument used for the case s = 3 can now be 
used to complete the induction, ¥’(= ¥*” here) playing the role of II: . 


X. CONSTRUCTION OF A CLASS OF REARRANGEABLE NETWORKS 
We consider a network v built of an odd number s 2 38 of stages, 
vp = Sig, +++ Gs—18s-1 , 


satisfying the symmetry conditions 


Gr = Osh 
k= Bae , 2(8 = 1), 
Sz = S$3—r41 


with each stage §, made of identical square switches. The ¢, will be 
chosen in the following way: order the switches of each stage; to define 
gx for a given 1 S k S 3(s — 1) take the first switch of 8; , say with n 
outlets and n a divisor of N, and connect these outlets one to each of 
the first n switches of 8,4; ; go on to the second switch of §, and connect 
its n outlets one to each of the next n switches of 8.41 ; when all the 
switches of 8,4: have one link on the inlet side, start again with the first 
switch; proceed cyclically in this way till all the outlets of $, are as- 
signed. (See Fig. 3.) 

We shall show that a network v constructed in this way is always 
rearrangeable. 


Theorem 8: Lets = 3 be an odd integer. Let nm, ,k = 1, --- , (s + 1)/2, be 


any positive integers such that 
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Sk x Sk+4 


Nk INLETS 





Fig. 3 — Assignment of inlets and outlets of rearrangeable network. 


1(s+1) 


J] n:=N and n;= 2. 
=I 


For eachk = 1, +++ , (s + 1)/2, let Il, be the partition of X = {1,--- ,N} 
into the N/n, sets of the form 
Axi = {t: (4 — 1)m, < t S in,} a=1,---,N/nm. 

Let go ,k =1,--- , (s — 1)/2, be permutations with the property thatn = t 
(mod N/nx41) tf and only af 

gn (n) € Angie t= 0, ---, (N/meqi) — 1. 
Define 

Oe =~ for (s—1)/2<kSs—1. 
Let 8 ,k = 1, --- , s, be the stages made of square switches defined by 

Sie = Ss_e41 k=1,---,(s—1)/2 


S& = UAXA 


AeT, 
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and v be the network constructed by putting a link pattern corresponding to 
gn,k =1,---,8 — 1 between stages $;, and 8,41 . Then v is rearrangeable. 


Proof: It is readily seen that fork = 1, ---,s 
P(8,) = the largest strictly imprimitive subgroup with sets of im- 
primitivity II; , 


= H,, 
in the notation of Theorem 2. Thus to prove the theorem by appeal to 
Theorem 2 it is enough to exhibit suitable partitions II", k = 1, --- , 


4(s — 1), and to show that these, together with g,, --- , ¢.1, satisfy 
the conditions of Theorem 2. For k = 1, --- , $(s — 1), set 


u* = class of all {7: (¢ — 1)N/m. --- ne <j S iN/m +++ me, 
¢=1,-+-, mma: mt. 

It is evident that the Il" are successively finer partitions, i.e., that 

1 2 k= 1,---,4#(s — 8). 
Also, since Il?“ consists of the nynz +++ Nye—1 sets 

(i: (& — L)nyety <7 S knyepy , b= 1, +++, mime +++ Maye} 

it can be seen that 

We? = (Ayeaietl ee < Nein}: 
and hence that the middle stage condition (27) in Theorem 2, 

Ty(spa) = nm, 


is satisfied. 
The remainder of the proof, in which the requisite covering properties 
of the g, are demonstrated, is based on some auxiliary results. 


Lemma 8: For k = 2,---,4(s —1),and1 Si S N/nynny: the following 
identity holds 


—1 
U Ax.,t = Yk ( U Arat,t) 
t=i-1 (estes 
nk 


N 
(mod ae 
NENK+1 


and the sets on the right are disjoint for different 7. 


Proof: Since 
ve (Agus) = {nen =t (mod N/m), 1Sns MN} 


the union on the right in the lemma is the set of all n that are =¢ (mod 
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N/nx41) for some t with (¢ — 1)n, < t S in, . Consider such an n, with 
say 





. t 
at Soe 
oe IN it t 4 1 < Nn =>2 
Net Ol < muy. 
Then 
nee IN + (¢—-—1)m4+ u, 
NENA 


with 0 < u S m, and so 


ne Ax, (in (ngngy ti-D) 
or 
neApzs, with ¢ = (4 — 1) (mod N/ngng41). 


Since the representation of n in terms in 7 and w is unique, the lemma 
follows. 

The practical or physical import of the lemma is this: In any stage 
k+1,1s k S 3(s — 1), the zth block of n;, switches is connected by 
the link pattern ¢; to exactly those nz; switches (in the kth stage) whose 
number ¢ = (4 — 1) (mod N/nzNnn41). 


Definition: For 1 SiS n,and2 Sk Sm = 3(s+1) 


By = U {Ae t = r(mod nynz «++ m_1) for some r with (7 — 1) 
t 


IA 
2. 


NeN3 +++ Near 
where non3 --- Ne 18 taken = Lif k = 
Lemma 6: For i StS nand2S5k<m 
—1 
Bu = ge (Bix4t). 


Proof: We show that the right-hand side contains the left. Equality then 
follows from Lemma 5, since B;,.4; will always be a union of sets of the 
form 


U Arsir: 


j-1<_1 <j 
nk 
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(r + 1) (mod w), 


1 congruent to an 


This is because t = r (mod wu) if and only if ¢ + 1 


if0 < r < wu. Consider then ann e By . There is at 
r (mod nye +++ Ne-1), With 


Vl 


v= E 


NenNs3 ° 


IIA 


a, 
7 N-1 


such that n ¢ Ax... The latter fact implies that 
(¢{-l)hu<n 8 in. 


Now ¢x(n) € Axyt,y, Where n is congruent to 7 (mod N/nx41), So we can 
represent n in the form 
Hence 


(t — 1)m < an, a 7S tn. 
Nk 
Writing ¢ = Inynz +++ mea + 7, With 


(¢ — 1)nen3-+- mia<rs 


= MN oe Nk-1 3 
we see that 
qm M+ (r— lm <7 S gm-s mt rm 
where 
aN 
qz=l ; 
MNe +++ Nhy 

It follows that 7 is congruent (mod nynz 
region 


n:) to some integer p in the 


(1 — 1) nz - * Nk <p Sim--+ nm, 

and thus ¢x() ¢ Bix.41, completing the proof of Lemma 6. 
Now if k = m, the defining condition that t = r (mod n 

some r with 


-* Nm—1) for 


* Mm-1 <r Sin. -> Nm—1 4 


used in the definition of By, , can be put into a slightly different form. 
In this case we must have 


1st S N/m = mne--+ M1 
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and so ¢ can only be congruent to r by being equal to 7, that is 
(4 — 1)me +++ Mma <t S ine +++ Mma. 
Hence it can be seen that 
{Bim,t = 1,°+:,m} = IV. 
Applying Lemma 6 (m — 2) times we find that 


{By , 2 = 1, eae , mi} = yg rear Om (II’) 


= ¢(¥’'). 
Now n = ¢ (mod N/ne) if and only if gi(m) ¢€ Ae, £ = 1,--- , N/ne. 
Also, by definition of Biz , 
Bi = U Ant. 


t=i(mod nj) 


Let gi(n) ¢ Be, gi(n) € Ax. Then n has the form 


n= cae ye eee (v4 bmi 
NyNo NyN2 

son = 7 (mod m). Since | Be | = | Bim | = ne +++ Mm = N/nz, it follows 
that By» is the g; image of N/n, integers each of which is congruent to 7 
(mod n). Since each such integer must be in a different Ay, , it follows 
that 9,’ covers II, from ¢,(W'). 

The remaining conditions in Theorem 3 can be demonstrated in es- 
sentially the same way; one has merely to identify the sets in question, 
and use Lemma 5 and an analog of Lemma 6. The details will be omitted. 
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Optimal Rearrangeable Multistage 
Connecting Networks 


By V. E. BENES 
(Manuscript received March 12, 1964) 


A rearrangeable connecting network is one whose permitted states rea- 
lize every assignment of inlets to outlets—that is, one in which tt ts possible 
to rearrange existing calls so as to put in any new call. In the effort to 
provide adequate telephone service with efficient networks it is of interest 
to be able to select rearrangeable networks (from suitable classes) having a 
minimum number of crosspoints. This problem is fully resolved for the 
class of connecting networks built of stages of identical square switches 
arranged symmetrically around a center stage: roughly, the optimal net- 
work should have as many stages as possible, with switches that are as small 
as possible, the largest switches being in the center stage; the cost (in cross- 
points per inlet) of an optimal network of N inlets and N outlets is nearly 
twice the sum of the prime divisors of N, while the number of its stages is 
2x — 1, where x is the number of prime divisors of N, in each case counted 
according to their multiplinty. By using a large number of stages, these 
designs achieve a far greater combinatorial efficiency than has been attained 
heretofore. 


I. INTRODUCTION 


A study of rearrangeable connecting networks, begun in a previous 
paper,! is here continued; the object of the present work is to solve the 
synthesis problem of choosing, from a class of networks that are built 
of stages of square switches and satisfy some reasonable conditions on 
uniformity of switch size, a rearrangeable connecting network having a 
minimum number of crosspoints. Some of the terminology, notation, 
and results of Ref. 1 are used, and familiarity with it will be assumed 
from Section IV on. 

Naturally, we do not pretend that minimizing the number of cross- 
points (used to achieve a given end) is the only consideration relevant 
to the design of a connecting network. Other factors, like the number 


1641 
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of memory elements, the amount and placing of terminal equipment, 
the ease with which a network is controlled (e.g., the possibility of 
reliable end-marking), etc., may be of overriding significance, depend- 
ing on the technology used. Still, it is important to know the limits of 
the region of possible designs, and these are obtained by optimizing on 
one variable without attention to others. 

The problem of designing a good rearrangeable network was (prob- 
ably first) considered in a paper of C. E. Shannon? investigating memory 
requirements in a telephone exchange. On the networks that he con- 
sidered he imposed the realistic “‘separate memory condition” to the 
effect that in operation a separate part of the memory can be assigned 
to each call in progress. This means that completion of a new call or 
termination of an old call will not disturb the state of memory elements 
associated with any call in progress. Shannon showed that under this 
assumption a two-sided rearrangeable network, with N inlets and N 
outlets, and N a power of 2, requires at least 


2N loge N 


memory elements (e.g., relays). He gave a design which actually real- 
ized this lower bound using 


4(2” — 1)log. N 


crosspoints (e.g., relay contacts). His design had the disadvantage of 
having very large numbers of contacts on certain relays. It is to be 
noted that Shannon was concerned with minimizing the number of 
memory elements, without regard to the number of crosspoints. 

Shannon’s separate memory condition is actually met by modern 
connecting networks that are of current practical interest, viz., by the 
networks made of stages of crossbar switches, considered here. For in- 
deed, an inlet relay on ann X n crossbar switch is used to close any and 
each of n crosspoints: the exact one that closes depends on what outlet 
relay is simultaneously activated. 

In this paper we consider the problem of minimizing the number of 
crosspoints in a network built of square switches, without attention to 
the number of relays. The following result (a consequence of Theorem 
8) then complements Shannon’s: For N a power of 2 it is possible to 
design a rearrangeable network with N inlets and N outlets using 4N 
log, N — 6N relays and 4N (log. N — 2) crosspoints. The figure for 
relays is roughly twice Shannon’s while that for crosspoints is much 
smaller than his, for N large. In our design, no relay controls more 
than 4 contacts. 
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II. SUMMARY AND DISCUSSION 


In Section III we discuss the notion of the combinatory power or 
efficiency of a connecting network, and propose to define it as the frac- 
tion r of permutations it can realize. According to this definition the 
four-stage No. 5 crossbar type of network with 10 X 10 switches has 
efficiency r close to zero, although it turns out that for the same number 
(1000) of terminals there are networks that achieve r = 1 with a 
smaller number of crosspoints.! This greater efficiency is obtained by 
using many more stages than four. 

Preliminaries are treated in Section IV. Particular attention is drawn 
to the class Cy of all two-sided networks having N inlets and N outlets, 
and built of stages of identical square switches symmetrically arranged 
around a center stage. The cost c(v) of such a network » is defined as the 
total number of crosspoints, divided by N. It is proposed to select rear- 
rangeable networks v from Cy that have minimal cost c(v). This problem 
is attacked in Section V by defining (7) a map T' from Cy to a special set 
A such that c(v) is a function of T(v) ¢ A, and (7) a partial ordering 
of A. It is then shown (Section VI) that (roughly) a network v is optimal 
if and only if 7(») is at the bottom of the partial ordering of A. This 
result allows one to identify (Section VII) the optimal networks in 
Cy. Their general characteristics are these: Except in some easily 
enumerated cases, the optimal network should have as many stages as 
possible, and switches that are as small as possible, the largest switches 
being in the middle stage; the cost c(v) of an optimal network » is 
very nearly twice the sum of the prime divisors of N, while the number 
of its stages is 2x — 1, where x is the number of prime divisors of N. 

Our chief conclusion is that by using many stages of small switches 
it is possible to design networks that are rearrangeable and cost less 
(in crosspoints per terminal) than networks in current use, which are 
far from being rearrangeable. The price paid for this great increase in 
combinatory power is the current difficulty of controlling networks of 
many stages. This difficulty is technological, though, and will decrease 
as improved circuits are developed. 


III. THE COMBINATORY POWER OF A NETWORK 


A principal reason why rearrangeable networks are of practical in- 
terest is (of course) that they can be operated as nonblocking networks. 
If the control unit of the connecting system using the rearrangeable 
network is made complex enough, it is in principle possible to rearrange 
calls in progress, repeatedly, in such a way that no call is ever blocked. 
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At present this possibility is being exploited in only a few special-pur- 
pose systems, because of the large amount of searching and data-proc- 
essing it requires. 

However, there is another reason why rearrangeable networks should 
evoke current interest. Even if we do not care to exploit it, the property 
of rearrangeability in a connecting network is an indication of its com- 
binatory power or reach, and so can be used as a qualitative “figure of 
merit’”’ for comparing networks. Other things being equal, a rearrange- 
able network is better than one which cannot realize all assignments of 
inlets to outlets. Rearrangeability expresses to some extent the efficiency 
with which crosspoints have been utilized in designing a connecting 
network for distribution, that is, for reaching many outlets from inlets. 

If a numerical measure is called for, one can use the fraction of real- 
izable maximal assignments. For a network »v with the same number 
N of inlets as outlets, and with inlets disjoint from outlets, this is just 


_ number of permutations realizable by » 


7 N! 
= combinatorial power of ». 


It is apparent that 0 S r S 1, and that for a rearrangeable network 
r = 1. Also, r may be viewed as the chance that a permutation chosen 
at random will be realizable. 

We shall calculate a bound on the combinatorial power r of the kind 
of connecting network most commonly found in modern telephone 
central offices. This is the network illustrated in Fig. 1. We choose the 
switch size n = 10 as a representative value; the network then has 
N = 1000 inlets, as many outlets, and 4 X 10° crosspoints. Clearly, 
the network can realize at most all the permutations that take exactly 
n terminals from each frame on the inlet side into each frame on the 
outlet side. Now a frame has n’ inlets (outlets), and there are 


n! 


(nly 


ways of partitioning n’ things into n groups of n each. Since there are 


2n frames, there are 
( n’! , 
(n!)” 


ways of choosing n groups of n each on each frame, and assigning inlet 
groups to outlet groups (one-to-one and onto) in such a way that for 
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- CROSSBAR 
/ SWITCH 


CUSTOMERS 
LINES TRUNKS 





aap eae A Oat DH) _! Kcr ae ee eee I 
LINE LINK FRAMES TRUNK LINK FRAMES 


Fig. 1 — Structure of No. 5 crossbar network. 


every inlet frame and every outlet frame exactly one group on the in- 
let frame is assigned to a group on the outlet frame. There are n” groups 
on a side (inlet or outlet), and within each group (at most) n! permu- 
tations can be made, 1.e., each inlet group can be mapped, terminal by 
terminal, in at most n! ways onto its assigned outlet group. Hence at 


most 
(eey (n!)™ 


permutations can be realized. There are N = n° terminals on a side, 
and a total of n°! possible permutations in all. Thus 


(n?!)2" 
3) bi 





r s yn? 
nin 


For n = 10, with 
20 log (100!) = 3159.4000 
100 log (10!) = 655.976 
3 log 27 = 0.89959 
log (a!) ~ 4 log 27 + (a + 3) logx — x logwne 
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we find roughly 
r<10™. 


Thus only a vanishingly small fraction of all possible permutations 
can actually be achieved by the No. 5 crossbar network (illustrated in 
Fig. 1) for m = 10, a reasonable switch size. 

In the example calculated, the network has a ‘‘cost”’ of 40 crosspoints 
per terminal on a side. Much of the force of the example would be lost 
if it were in fact impossible to achieve high values of r (i.e., near 1) 
without incurring a great increase in the cost in crosspoints per termi- 
nal. This, however, is not the case. It follows from our Theorem 8 that 
a rearrangeable network (r = 1) can be designed for NV = 1024 terminals 
on a side using only 


4(log, N — 2) = 32 


crosspoints per terminal. Thus it is actually possible to achieve r = 1 
for more than 1000 lines with fewer than 40 crosspoints per line. The 
network that does this turns out to have 17 stages instead of 4, an 
illustration of the way that allowing many stages can lead to vastly 
more combinatorially efficient network designs. The middle stage of 
this network consists of a column of 256 4 X 4 switches, and each of 
the other 16 stages, arranged symmetrically, consists of a column of 
512 2 X 2 switches. For k = 1, --- , 8, the kth stage is connected to 
the (k + 1)th as follows: the first outlet of the first switch of stage k 
goes to the first switch of stage (k + 1), the second outlet of the first 
switch of stage k goes to the second switch of stage (k + 1), the first 
outlet of the second switch of stage k goes to the third switch of stage 
(k + 1), ete., as in Fig. 2 with 1 S k S 7; when each switch of stage 
(k + 1) has 1 link on it the process starts over again with the first 
switch, and continues cyclically until all the links from stage k are 
assigned. The connections between stages k andk + 1 fork = 9,---,17 
are the inverses of those for k = 1, --- , 8, so that the network is sym- 
metric about the middle stage.’ 


IV. PRELIMINARIES 


The symbol Cy , N = 2, is used to denote the class of all connecting 
networks v with the following properties:* | 

(1) vis two-sided, with N terminals on each side . 

(2) v is built of an odd number s of stages 8, k = 1, ---, 8, of 


* Familiarity with Ref. 1 is assumed henceforth. 
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STAGE k STAGE K+1 





Fig. 2— Link assignment. 


square switches, i.e., there are permutations yg; , +--+ , gs-1 such that 
vp = SigiS2, +°* 5 Psi8s 
(3) vis symmetric in the sense that 
Sx = Ss—n4i for k=1,:---,$(s -—1) 


(4) With the notation 


s = s(v) = number of stages of »v 


nN, = n,(v) = switch size in the kth stage of », 


vy has N/n; identical switches in stage k, i.e., each stage S, is of 
the form 


U AXA 


Aell 


for some partition II with | A | = | B| for all A,B eI. 
The defining conditions of Cy imply that 


Ny = Ns—h4s for k=1,---,(s —1)/2 


and that 
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(s+1) 


N. 


Nk 
k=1 


It is assumed throughout that n,(v) = 2 for all y and all k = 1, ---, 
s(v). 

The cost per terminal (on a side) c(v) of a network »v e Cy is defined 
to be the total number of crosspoints of » divided by the number N 
of terminals on a side. Since there are N/n;, nx X nz switches in stage k, 
the total number of crosspoints is (using the symmetry condition) 


>» (N/nz) «ne = NY, Nk 
k=1 k=1 


$(s—1) 
N (soso 2 pa ms) 


I 


and so 
4(s—1) 
mys) $2 Dy re. 


c(v) 


I 


A network » is called optimal if 
c(v) = min {c(yn): u € Cy}. 


It is clear that the cost per terminal of a network v ¢ Cy depends 
only on the switch sizes, and not at all on the permutations that define 
the link patterns between stages. 

Also, it is apparent from Theorem 3 of Ref. 1 that given any network 
y, € Cy there is another network v2 ¢ Cy that is rearrangeable and differs 
from » only in the fixed permutations that are used to connect the stages; 
in particular, v, and ». have the same number of crosspoints. Thus the 
problem of selecting an optimal rearrangeable network from C‘y is equiv- 
alent to that of choosing an optimal network from Cy , rearrangeable 
or not. A network in Cy can be made rearrangeable by changing its 
link patterns at no increase in cost. 


We make 


Definition 1: m= m(v) = [s(v) + 1]/2 = numerical index of 
the middle stage 
n = n(v) = Nn = size of middle stage switches 


Definition 2: O(v) = {m, +--+, Mm} = the set of switch sizes (with 
repetitions) in outer (i.e., nonmiddle) stages 


Definition 3: w=(N) = {O(r): v € Cy}. 
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Remark 1: c(v) = n(v) +2 >> a. 


ze0(v) 
Theorem 1: Let (A,n) be a point (element) of 
w(N) XK X 
with 
nIly=N. 


yea 


Then there exists a nonempty set Y C Cy such that 
T(v) = (A,n), ve Y, 


The v’s in Y differ only in the permutations between the stages and in 
the placing of the outer stages, and at least one of them is rearrangeable. 
This result follows from the definition of Cy and from Theorem 2 of 
Ref. 1. 


V. CONSTRUCTION OF THE BASIC PARTIAL ORDERING 


The solution to the problem of synthesizing an optimal rearrangeable 
network from Cy will be accomplished as follows: we shall define a 
mapping T of Cy into w(N) X X, with X = {1, --- , N}, and a partial 
ordering < of T(Cy); the map 7’ will have the property that c(v) is a 
function of 7'(v); then we shall prove that (roughly speaking) a net- 
work » is optimal if and only if 7(yv) is at the “bottom” of the partial 
ordering, i.e., that c(v) is almost an isotone function of T(v). 

To define a partial ordering of a finite set, it is enough to specify 
consistently which elements cover which others. Let Z,20,Z1, --- be 
sets of positive integers <N possibly containing repetitions. 


Definition 4: Z, covers Z,: if and only if there are positive integers 7 
and k such that k occurs in Z; , 7 divides k, and Z» is obtained from 7; 
by replacing an occurrence of & with one occurrence each of j and k/j. 


Definition 5: Z) S Z if and only if there is an integer n and sets 2, , 
Z2,°°* 5 2n such that Z;4, covers Z;,7 = 0,1,-°--,n—IlandZ, = Z. 


Definition 6: T: » > O(r), n(»). 
A partial ordering S of T(Cy) is defined by the following definition of 
covering: 


Definition 7: Let », v be elements of C(NV). 
T(u) covers T(r) if and only if either 


1650 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


(7) n(v) < n(p), n(v) divides n(u), and O(v) results from O(,) 
by adding an occurrence of n(u)/n(v), or 
(27) n(v) = n(u) and O(u) covers O(r). 


VI. COST IS NEARLY ISOTONE ON T(Cy) 
Theorem 2: If T(v) S T(u), and n(n) > 6, then 
c(v) S c(p). 


Proof: It is enough to prove the result for » and »y such that T() covers 
T(r). 


Case (1): n(v) < n(n), n(v) divides n(n), n(v) 2 2, and O(yv) results 
from O(u) by adding an occurrence of n(n) /n(r). Then 


c(vy) = n(vy) +2 DS 2 


zeO(v) 


= nv) +*™ 420 2 
ny ) zeO(u) 














6g) Gye ae) 
7 Py — mad] og 2a) 
satay) E 2 | 4 Pate) 


Thus c(v) S c(yz) if and only if 


n(n) 2n(u) 
my) (1 7 we) Gay 








that is, if 
2y 
eee | 
where « = n(v) and y = n(u)/n(v). Now n(n) > 6 implies that either 


IIA 


xv 


(4) nv) = 2 and 7") > 4 
n(v) 


or 


Gy Myre sae ss 
n(v 


or 


(111) n(v) = 3 
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The condition 2y/(y — 1) S «x is fulfilled in all three cases, and so 
c(v) S c(p). : 
Case (2): n(u) = n(v) and O(n) covers O(v). There exist integers 
j,k such that j divides k, 7 = 2 in O(u), and O(y) results from O(,) 
by replacing one occurrence of & with one each of j and k/j. Then 


c(v) = n(vy) +2 dz 


zeO(v) 


n(u) — 2k + 25 + (2h/j) + 2' a x 


I 


c(u) — 2k + 27 + (2k/9). 
Since j divides k andj = 2,k = 2j and k = 2k/j, so 


_k ee 
k = 2max (3,4) > + = 
de : J 
and c(v) < c(p). 


Theorem 38: If v « Cy and O(v) does not consist entirely of prime numbers 
(possibly repeated), then there exists a network pw in Cy of s(v) + 2 stages 
with c(u) < c(v), and v cannot be optimal in Cy . 


Proof: There is a value of k in the range 1 S k S n(v) — 1 for which 
nN is not a prime, say n, = ab. Define stages 8;(u), 7 = 1, --- , s(v) + 2 
as follows: 


Siu(u) = 8%), gukt+i,---,n(y); 
let Il, , Il, be partitions of X = {1, --- , N} with 


[la] = N/a and AceI,=|Al|=a 
[Il,.| = N/b and Bell, > |B| = 0. 
Set 
Sryi(u) = ae 
S.(u) = U B 
Belly 
Si(u) = 8,(0) jolene, k-1 
Si(u) = Sorsui(e) allj = 1, «++, 8(v) +2 
By Theorem 2 of Ref. 1 permutations g,, - ++ , gs()-1 can be found so that 


the network 


M = $191, °° * 5 Ps(u)—1Ss(u) 
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is in Cy and is rearrangeable. It is apparent that nny = Amv) and that 
O(v) covers O(u). Hence the argument for case (77) of Theorem 2 shows 
that » has strictly lower cost than ». 


Corollary 1: If N > 6 and is not prime, then a network v consisting of one 
square switch 1s not optimal. 


VII. PRINCIPAL RESULTS 


Definition 8: An element Tv) of T(Cy) is ultimate if there are no 
ue Cy such that 7T(v) covers T(z). 


Remark 2: T(v) is ultimate if and only if n(v) is prime and O(v) con- 
sists entirely of prime numbers. 


Definition 9: An element 7'(v) of T(Cy) is penultimate if it covers an 
ultimate element. 


Definition 10: p, ,n = 1, 2, --- , is the nth prime. 


Definition 11: r(n) is the prime decomposition of n, that is, the set 
of numbers (with repetitions) such that 


al 


N= pr pr”? +++ pi 
if and only if r(n) contains exactly a; occurrences of p;,7 = 1, ---, 
l, and nothing else. 


Definition 12: p is the largest prime factor of NV. 


Lemma 1: If p = 8 and N > 6 is even, then the following conditions are 
equivalent: 
(1) v ts optimal 
(11) T(r) ts penultimate and n(v) = 6 or 4 
(vt) T(v) = (a(N/)6,)6 or (7(N/4),4). 


Proof: By Theorems 2, 3 only »v with n(v) S 6 and O(yv) consisting 
entirely of primes can be optimal. Writing N = 2°38’ with x 2 1 and 
y = 1, it is seen that such vy must have a cost c(v) having one of the 
forms 


2+ 2[2(4 — 1) + 3y] = 4 + by — 2, 
3 + 2[2x + 3(y — 1)] = 4a + by — 3, 
4 + 2[2(4 — 2) + 3y] = 42 + by — 4 
(only occurs if x > 1), 


6 + 2[2(2 — 1) + 38(y — 1)] = 4a 4+ by — 4. 
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The least of these is either of the last two, which correspond to n(v) = 
6if# = lorton(y) = 6Gor4ifz > 1. It is apparent that (27) is equiv- 
alent to (22). 
Lemma 2: If p = 2, and N > 4, then the following conditions are equiva- 
lent: 
(2) v ts optimal 

(it) T(v) ts penultimate and n(v) = 4 

(wt) T(v) = (a(N/4), 4). 
Proof: With N = 2” it can be seen as in Lemma 1 that only those v 
can be optimal whose cost c(v) has one of the forms 


4 + 2/2(¢ — 2)]. 
The second of these is the better, and corresponds to n(v) = 4. 


Theorem 4: Let p be a network such that a prime number r > n() occurs 
in O(n). Let M result from O(u) by replacing one occurrence of r by n(n). 
Then for any network v with 


T(v) = (Mr) 
at ts true that 
c(v) < e(u) 


1.€., v 1s strictly better than p. Among such v, that is best for which r is 
largest. 


Proof: Existence of a rearrangeable v satisfying T(v) = (M,r) is guar- 
anteed by Theorem 1. For the rest of the proof, we observe that r > n(u) 
and 


niu) +2 di 2 


zeO (ph) 


c(u) 


n(u) + 2r — 2n(u) +2 Dd 2x 


zeM 


r— n(n) + e(y): 


Theorem 5: If n(w) S 6, n(w) = 2°38%5’, some prime number r > 3 oc- 
curs in O(u), and if M results from O(n) by replacing one occurrence of 
r by x occurrences of 2, y occurrences of 3, and z occurrences of 5 then for 
any network v e Cy with 


T(v) = (M,r) 
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at is true that 


c(v) S c(z) 


1.e., v is at least as good as uw. Among such v, that is best for which r is 
largest. 


Proof: Existence of a rearrangeable v ¢« Cy satisfying T(v) = (M,r) is 
given by Theorem 1. For the rest of the proof, we observe that r 2 5 
and 


nu) +2 Dd) u 


ueO (py) 


n(n) + 2r — 4x — Gy — 102 +2 >> u 


ueM 


c(m) 


r+n(u) — 4c — Gy — 102 + c(r). 


Since x, y, and z can only assume the values 0 and 1, with z = 1 
if and only if « = y = 0, we have c(p) = c(v), the best v corresponding 
to the largest r. 


Definition 18: Q = {(A,r):ra prime and A = r(N/r)}. 
Definition 14: L = 77'(Q). 


Remark 2: Q consists of all the absolute minima in the partial ordering S$ 
of T(Cy), i.e., v « L implies that there are no u ¢€ Cy for which 


ne T(u) < T(r). 
Theorem 6: If p > 3, then all optimal networks belong to L. 


Proof: Let » ¢ Cy — L be given. We show that there exists a v e L that 
is at least as good. 


Case 1: There is a sequence w = 1, fe, °°* , Wn, Vv With, + », ve L, 
Nun) > 6, 


T(m) 2 T(w2) 2 +--+ = Tun) 


and such that 7'(u,) covers T(v). Then the numbers n(u;), 7 = 1, 
- ,nare all > 6, and the result follows from Theorem 2. 


Case 2: All sequences up = w1, 62, °°, un, Vy Withp, ~ v,veL, T(m) = 
T(u2) 2 ++: 2 T(u,), and such that 7(y,) covers T(v), are such that 
(un) < 6. Consider such a sequence. Let 7 be the smallest index 7 
for which n(u;) S 6,7 = 1, --- ,n. Then Theorem 2 gives c(u) = c(us). 
Since n(u;) S 6 and T(u,) covers T(r), it follows that O(u;) contains 
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an occurrence of p > 3. Hence by Theorem 5 there exists a network 
ne Cy with n(n) = p and 


Cn) Ss cay) = cla): 
Let & « L be such that n(é) = p and T(m) covers T(£). Then c(é) S 
c(n) by case (77) of Theorem 2. Hence 
c(é) S c(u) 
EeL. 


Theorem 7: If N S 6 and vis optimal, then vis a square switch and c(v) = 


N. 
Proof: For prime N with 2 S N < 6 the result is obvious. If VN = 6 


and v e Cs then exactly one of the following alternatives obtains: 
T(v) = (6,6) and c(v) = 6 
T(v) = ({3},2) and cy) = 8 
Py = Qot. 3) and -clpy = 7: 


The first alternative is optimal, and there is exactly one v e Cs such 
that T(v) = (6, 6), viz., the 6 X 6 square switch. Similarly, if N = 4 
and vy e (,, then T(v) = (6, 4) or ({2}, 2); the former has cost 4, the 
latter 6. 


I 


Definition 15: For n = 2, D(n) is the sum of the prime divisors of n 
counted according to their multiplicity; thus if 


n — DA1Q%2 eee ae” 
then 


D(n) = Dy Pits = 2 wv. 


zemw(n) 


Definition 16: c(N) = min {c(v): v e Cy}. 
Theorem 8: 


NiiN $6 or N ts prime 
c(N) =p + 2D(N/p) if N > 6 and either p > 3 or N is odd 
2D(N/2) af N > 6 in all other cases (1.e., p = 2, or 
p = 3and N 1s even). 


Proof: Putting together Lemmas 1, 2 and Theorems 1, 2, 3, 4, 6, and 7 


1656 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


we obtain the following values for the minimal cost in crosspoints per 
terminal on a side for networks in Cy: 


N if N S 6 or N is prime 


p+2 >) ifp >3,N>6 
zemw(N/ p) 
6+2 > «=2 Dd cz uf p = 3, N > 6, N even 
vem (N/6) zew(N/2) 
c(N) =<{<83+2 >) x =3+4 6(log; N — 1) 
wew (N/3) 


if p = 3, N > 6, N odd 
4+2 >) x =4(lonN—2)=2>5 a 


wet (N/ 4) xem (/2) 
ifp = 2,N > 6; 


simplification gives Theorem 8. 
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Attitude Determination and Prediction 


of Spin-Stabilized Satellites 


By L. C. THOMAS and J. O. CAPPELLARI 
(Manuscript received February 17, 1964) 


Techniques for both attitude determination and prediction for spin- 
stabilized satellites are developed. Their use is demonstrated using Telstar 
I and ITI satellite data. It ts shown that an inclined dipole model of the 
earth’s magnetic field and the method of averaging the gravitational and 
magnetic torques over each anomalistic period of the satellite permits attitude 
predictions to within a few tenths of a degree of determined values in most 
instances. In those few cases where departures are above one degree, explana- 
tions are presented to show the reason for such discrepancies. 

The usefulness of combining optical flash and solar sensor data for 
attitude determination and their inherent accuracy are demonstrated. Opti- 
cal flash data can provide loci with a resolution of 0.1°. Solar sensor loci 
are resolved to within 1°. 

All techniques described have been consolidated into working computer 
programs which follow closely the mathematical analysis presented. A num- 
ber of important supporting calculations such as the solar position, sidereal 
time, orbit updating, etc. are also developed. Because of the complexities of 
the mean torque and gyroscopic equations, the precessional techniques 
presented are most useful in computer embodiments. 
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I. INTRODUCTION 


To maintain a defined attitude in space, the Telstar I and IT satellites 
were spin stabilized. By this method of passive attitude control, a 
satellite is rotated about an axis of symmetry and consequently exhibits 
the characteristics of a gyroscope. In the absence of disturbing torques, 
the satellite’s spin axis maintains its spatial orientation fixed with 
respect to an inertial reference frame throughout its orbit. For the 
Telstar I and II satellites, this is desirable because of certain required 
attitude constraints. First of all, the satellite communication antenna 
is not omnidirectional. More energy is radiated along the equator of 
the satellite than along its spin poles, as shown by the antenna pattern 
of Fig. 1. This fact dictates an attitude for which the line of sight from 
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Fig. 1 — Antenna pattern at 4170 me. 
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Fig. 2 — Skin temperature distribution vs polar angle. 


a ground station to the satellite avoids its poles. The second attitude 
constraint involves temperature considerations. A good degree of tem- 
perature control is obtained by orienting the spin axis so that it is 
nearly perpendicular to the satellite-sun line. In this manner, tempera- 
ture balance is maintained by the satellite’s spin as indicated in Fig. 2. 
Here, ¢ is the solar offset angle, defined as the angular departure of the 
spin axis from perpendicularity with the satellite-sun line. Solar offsets 
of about 15° result in temperature deviations of about 150° (see Fig. 2) 
and are tolerable. 
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An ideal orientation from a communications standpoint would be to 
have the spin axis nearly parallel to the earth’s surface as it passes over 
any ground station. A spin axis perpendicular to the orbital plane would 
accomplish this, but would produce a maximum axis tilt toward the sun 
equal to the sum of the orbital inclination and the earth’s 233° tilt with 
respect to the plane of the ecliptic (see Fig. 3). Under these conditions 
the spin axis of the first Telstar satellite would have a maximum solar 
tilt of 68°. This exceeds the 15° tilt limit dictated by temperature 
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Fig. 3 — Geometry of spin axis perpendicular to orbit plane. 


balance. On the other hand, if the spin axis is made perpendicular to the 
ecliptic plane (which automatically insures its perpendicularity to the 
sun), the axis tilt with respect to the line of sight from any ground 
station over which it may be passing will range from 90° to 90° minus 
the sum of the orbit inclination and ecliptic obliquity or 22° (see Fig. 
4).* From a communications standpoint this is tolerable, since nulls 
in the antenna pattern are major only within about 15° of the spin 

* With this orientation, stations south of the satellite’s instantaneous earth 
latitude would experience "angles less than 22°, such as station B of Fig. 4. The 
major ground stations for Telstar I and II, however, are all at latitudes above 


43°. Since the inclinations of the Telstar I and II satellites are 45° and 42.7° re- 
spectively, these stations are almost always north of the satellite. 
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Fig. 4— Geometry of spin axis perpendicular to ecliptic. 


poles (see Fig. 1). Rising and setting satellites may experience a greater 
tilt to the ground station line of sight, but this is unavoidable. 

Because of these considerations, the Telstar I and II satellites were 
launched with their spin axes as nearly perpendicular to the ecliptic as 
the powered flight trajectory of the Thor-Delta launch vehicle would 
permit and still meet certain orbital requirements, such as inclination, 
apogee height, and perigee height.1 The predicted attitudes of these 
satellites at orbit injection are calculated from telemetry data from the 
first and second rocket stages. Table I lists these attitudes as the right 
ascension and declination direction of the north pole of the satellite. 


TaBLeE [—AT?riITUDE OF THE TELSTAR I AND II SATELLITES 
AT INJECTION, DETERMINED FROM POWERED FLicHtT Data 





Initial Attitude 
Satellite 





Right Ascension Declination 


Telstar I 83.73° —66.80° 
Telstar IT 82.23° —57.31° 
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The north pole is defined here as the direction of advance of a right- 
handed screw turning in the direction of the satellite’s spin. It also 
happens to be the spin pole which carries a helical telemetry antenna. 

The initial attitudes of the Telstar I and II satellites are changed 
by gyroscopic precession. This motion of spin-stabilized satellites is 
chiefly produced by both magnetic and gravitational torques. The 
former is a result of interaction between residual and eddy-current- 
produced magnetic fields of the satellite and the earth’s magnetic field. 
The latter is produced by differential gravitational forces acting across 
the body of the satellite. In the present cases, the magnetic torques are 
several orders of magnitude in excess of the gravitational torques. 

It is necessary to predict precession to allow proper scheduling and 
planning for satellite use, to resolve certain attitude determination 
ambiguities, and to sensibly plan alteration of satellite attitude in a 
prescribed manner when needed. (See Section V for a description of this 
technique.) It is the purpose of this paper to outline the methods of 
attitude determination and correction, to develop the precessional 
theory, and to show the application of these to the Telstar I and II 
satellites. 


Il. ATTITUDE DETERMINATION —— GENERAL REMARKS 


The attitude of the Telstar I satellite has been determined through 
the analysis of two sets of data: the time of optical flashes of sunlight 
from three mirrors attached to the surface of Telstar and the current 
produced by six on-board solar sensors located on the ends of three 
orthogonal axes. 

The first of these sets of data in combination with the spatial position 
of the satellite, sun, and observer’s position determines a locus of possible 
spin axis positions which would result in the observed flash. This locus 
describes a cone in space whose axis is the mirror normal. The six solar 
sensors are designed to determine the direction of the satellite-sun line 
with respect to a satellite frame of reference and thus to the spin axis. 
Knowing this angle, again there is defined a conical array of possible 
spatial spin axis directions. The intersection of the optical cone with 
this solar sensor cone should, therefore, determine two possible attitudes. 
A priori knowledge of the approximate attitude as provided by launch 
data and/or a succession of measurements and predictions over an 
interval of several days permits the determination of a unique spin 
axis direction in space. 

In practice, the solar sensors present a few difficulties. First of all, 
the deduction of the sun’s position from solar cell current entails inferring 
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a solar angle between each cell’s normal and the sun’s direction. Such a 
direction from each of three cells uniquely establishes this solar aspect.* 
Thus calibration curves relating electrical output to light intensity must 
be employed for each cell along with temperature corrections. While 
D. W. Hill? has prepared a computer program to take the drudgery out 
of this work, it remains difficult to calculate the solar angle for a cell if 
it is illuminated in a direction far from its normal. Moreover, the solar 
sensor data, which are reported every minute by telemetry, were found 
to be not always mutually consistent. Often, the solar direction calcu- 
lated over 30 minutes from a succession of telemetry frames had a spread 
ranging from 1° to over 8°. A correlation between these deviations and 
the spatial position of the satellite exists which suggests a biasing of 
solar cell data, on occasion, by the earth’s reflectivity. 

To show the geometry of this situation, consider the satellite position 
shown in Fig. 5. Here one half of the satellite to the right of lme AB is 
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Fig. 5 — Satellite shadow geometry. 


illuminated by the sun. At least one half, to the upper left of line CD, 
is illuminated by the earth in reflecting about 38 per centf of the sun’s 
total incident light. Thus a solar cell in region BOD would record only 
direct sunlight, one in region AOC would record only earthshine, and 
* The solar aspect is the angle between the satellite’s spin axis and the satel- 
lite-sun line. 
t This is not the albedo but the earth’s mean reflectivity or ratio of mean earth 


brightness measured at a spot along the earth-sun line relative to the brightness 
of a perfectly diffusing disk of the same size and at the same distance that the 
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one in region AOD would record both. There would be no light from 
either the sun or the earth in region COB. By the difference in solar 
illumination and earthshine, data from cells in regions BOD and AOC 
are easily separated and the latter disregarded. However, those cells 
in region AOD measure both sunlight and earthshine in an inseparable 
manner and therefore report erroneously the sun’s position. If AOD is a 
large angle, the spin of the satellite will carry solar sensors into that 
region frequently and result in sizable variations in successive solar 
aspect determinations. Since AOD equals SET, as the subsatellite 
position (T) departs from the subsolar point (S), the solar cell data 
must be carefully interpreted to avoid false conclusions. Under these 
conditions, it would be wrong, for example, to simply calculate the mean 
of all solar aspect determinations over a succession of telemetry frames. 
This procedure, in general, would not yield a good estimate of the true 
aspect. A better technique would be to calculate the mean for only 
those solar aspect determinations in which all three cells entering into 
the determination exhibited current readings above those which could 
possibly be produced by either the earth’s reflectivity or low angles of 
solar illumination. Operating in this fashion, the true solar aspect can 
be determined to within about 1°. 

Since the maximum attitude resolution obtainable using optical 
. flashes from the Telstar satellites is about 0.1°, it was decided to rely 
on these for attitude determinations insofar as possible. Two groups 
of flashes close together in time are needed for an attitude fix, however, 
and in cases where only a single one existed, the attitude was determined 
by a combination of mirror flashes and the solar sensor data previously 
described. 

The optical reflections are characterized by a series of intermittent 
flashes provided by the spin of the Telstar satellites. The time midpoint 
of these flash series is determined by photoelectric equipment* at Bell 
Telephone Laboratories in Holmdel, N. J. The time of each flash series 
determines a conical locus of possible spin axis positions about the 
flashing mirror’s normal. The tip of the spin axis vector, therefore, 
lies on a circle on the celestial sphere. One of the intersections of two 
such circles defines the attitude, provided the two corresponding flash 
series occur close enough together in time so that no appreciable preces- 
sion occurs during the separation interval. Since the mirrors employed 
have their normals far removed from the spin axis (68° and 95°), it is 
earth is from the given spot. Quite naturally a phase law applies which reduces 
the light reflected to points off the earth-sun line. Off-line brightness varies 
crudely as the ratio of observable illuminated area of the earth as seen from the 


point in question to the total observable illuminated area seen from a point on 
the earth-sun line, 
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always possible to select the valid locus intersection from past per- 
formance and attitude predictions. 


II. ATTITUDE DETERMINATION — COMPUTATIONAL TECHNIQUE 


A computational technique has been developed for the calculation 
of spin loci. It consists of four basic parts, which are: 

(¢) the determination of the right ascension and declination of the 
normal of the flashing mirror; 

(zz) the determination of the right ascension and declination of the 
sun in the case of solar cell data; 

(iz) the construction of a circular locus on the celestial sphere cen- 
tered on the right ascension and declination of the mirror normal and 
having a radius of 68° or 95°, depending upon the mirror involved (in 
the case of solar aspect data from the sensors, the locus is centered on the 
right ascension and declination of the sun and has a radius equal to the 
measured solar aspect angle); 

(tv) the plotting, by computer microfilm techniques, of these circles 
and others determined from additional flash series and solar aspect data 
to ascertain intersections and corresponding satellite attitudes. 


3.1 The Right Ascension and Declination of the Mirror Normal 


Consider the generalized mirror orientation shown in Fig. 6. Here, a 
ray from the sun strikes the mirror and is reflected toward a particular 
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Fig. 6 — Geometry of mirror normal. 
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ground station. For this to be possible, the mirror normal must bisect 
the angle 2a and lie in the plane determined by the mirror-sun line and 
the mirror-ground station line. 

Sketched around the mirror in Fig. 6 is the celestial sphere. On this 
sphere the solar direction, as seen from the mirror, is indicated by its 
right ascension, a,, and its declination, 6,. In like fashion, the direc- 
tion of the satellite mirror as seen from the ground station is specified 
as a,6 and the direction of the mirror normal is a6, . 

To determine a, and 6, , we begin by solving for are g, and A in the 
spherical triangle 1, 2, 3: 


cos g; = Sin 6 sin 6, + cos 6 cos 6; cos (as — @) (1) 


eee ee sin 6, — sin 5 COS Js (2) 
cos 6 SIN gs 


where angles g, and A may have values from 0° to 180°. 
By triangle 1, 2, 4, one obtains the declination of the normal as 


5, = sin” [sin 6 cos g, + cos 6 sin g, cos A] (3) 


where 6, may have values between +90°. 
Using the same triangle we may write 


_.. -1 [sin (g, + @) sin | } 
C = sin [ an tee Fo) sin bs A (4) 
where 


C may have values between 0° and 180°. Since a, may either exceed a 
by C or be less by the same amount owing to the two possible orienta- 
tions of triangle 1, 2, 4, we have 


an = a+ SIGNE (Cya; — a) (6) 


where SIGNF, a common computer symbol, indicates that the algebraic 
sign to be affixed to C shall be determined by the quantity a, — a. 

Right ascension is measured eastward from the vernal equinox as a 
0 reference through 360°, and if a, and a should lie on opposite sides in 
this reference (C' remaining less than 180°), (6) becomes 


a = a + SIGNE (Cia — az). (7) 


Thus (6) isemployed if |a, — a| — 180° is negative and (7) if other- 
wise. 
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3.2 The Right Ascension and Declination of the Sun 


The apparent right ascension and declination of the sun at any 
specified time is computed from the mean orbital elements of the sun.‘ 
These may be expressed beginning with the true solar mean anomaly in 
degrees as 


M, = 358.47583 + 0.9856002670d’ — 0.0001507” — 0.0000037° 
where 


T = the time in Julian centuries of 36525 ephemeris days from 
January 0.5, 1900, ephemeris time? 
d’ = ephemeris days from same epoch. 


We may express the above equation in a more useful form for the present 
calculations by changing the epoch to January 1.0, 1960 and writing 
an equivalent expression as 


M, = 357.41283 + 0.985600267d 
where | 


d 


ephemeris days since the 1960 epoch 
mean solar days since 1960 epoch + 1 second for the years 
1963, 1964, 1965. 


One obtains the apparent mean solar anomaly, used in the present calcu- 
lations to determine the apparent position of the sun, by antedating for 
the solar light transit time. For this reason d is increased by 0.005375 
day, which is the light transit time at mean solar distance. Since the 
earth is about 3 X 10° miles closer to the sun in winter as compared to 
summer, this can produce an error in apparent solar position of about 2 
seconds of arc. 
The eccentricity of the earth’s orbit is 


e = 0.01675104 — 0.000041807 — 1.26 x 10/7" 
= 0.01700254 on January 1.0, 1960. 


Also, the degrees of mean celestial longitude of the perigee of the sun’s 
mean orbit about the earth as a reference is 


L = 279.69668 + 0.9856473354d’ + 0.0003037” 
282.25247 + 0.470684 x 10 “d. 


Finally, the mean obliquity of the ecliptic in degrees is 
e = 23.452294 — 0.01301257' — 1.64 x 10°°7" + 5.03 X 10°'7". 
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The apparent mean anomaly and eccentricity along with Kepler’s 
equation permit the calculations of the apparent true anomaly of the 
sun by standard techniques.® This along with the apparent mean longi- 
tude of solar perigee and mean obliquity of the ecliptic allows a direct 
determination of the apparent right ascension and declination of the 
sun at any date by the simple geometry shown in Fig. 7. 

The apparent right ascension and declination of the sun computed 
from its mean elements in this manner can depart from the tabular 
values (which include nutation) of The American Ephemeris and Nauti- 
cal Almanac for the years 1962, 1963, and 1964 by about 5 seconds of 
are as a maximum. This is well within the approximate 1° error of the 
measuring techniques for solar aspect as well as the 0.1° for the mirror 
technique. 


3.3 Construction of the Locus Circle 


As previously stated, the direction of the spin axis may be specified 
as a circle on the celestial sphere, centered on the right ascension and 
declination of the mirror normal. This locus may be generated by using 
the spherical triangle shown in Fig. 8. Here the dotted curve indicates 
the attitude locus of all possible positions of the spin axis. Let ajé; be 
any point on this circle making a fixed are F' with a, , 6, . In the case 
of either the Telstar I or II satellite, / will equal 68° or 95°, the angles 
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Fig. 7 — Geometry of the sun’s position. 
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Fig. 8 — The attitude locus. 


the mirror normals make with the spin axis. From Fig. 8, 
6; = sin” [sin 6, cos F + cos 6, sin F cos A] 


where 
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(8) 


A = a running variable which takes on values from 0 to 360° to 


generate the locus 


6, = declination of a locus point which ranges from —90° to +902, 





in general. 
Also 
a sin F sin A 
cos 61 
eae (cos F — sin 6,) sin 6; 
COS bn COS 67 
_; sin B 
B= tan’ 
cos B 
and 
a, = ar,t+ B. 


(9) 
(9a) 


(10) 


(10a) 


In general, B may range from 0° to 360°. All quadrant ambiguities 
presented by (10) are resolved by noting the algebraic signs of (9) and 


(9a). 


If there is a time uncertainty in measuring the midpoint of a flash 
series, this will result in more than one possible mirror normal, a,6, , 
and hence a number of attitude loci, since the satellite and sun will 
occupy successive positions along their paths within the time error. 
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If during a single sky trajectory a ground station records two separate 
flash series (or a single series plus solar aspect data), each will generate 
a circular locus. The intersections of these loci determine attitude with 
an ambiguity of two. That is, there are two intersections and therefore 
two possible satellite attitudes which satisfy the two flash series. A 
priori knowledge of the attitude from previous measurements or initially 
from launch parameters together with some knowledge of the expected 
precession will permit selecting the proper intersection. Knowledge of 
expected precession is also needed to determine whether or not the flash 
series are close enough together in time to neglect incremental precession 
during the time between the series. Such precession can alter the position 
of the loci intersections. 

Tig. 9 shows an example of the attitude determination of the Telstar 
II satellite for passes 135, 136, 199, 272, and 472. Solid lines indicate 
attitude loci determined from mirror flashes. The line of zeros is typical 
of those determined by solar sensor data. Two attitude determinations 
are shown by intersections on passes 135, 136, and on pass 272. 


IV. ATTITUDE PREDICTION 


Acting upon an orbiting spin-stabilized satellite to produce precession 
are certain disturbing torques. Those to be considered here are gravita- 
tional and magnetic torques. In the case of the Telstar I and II satellites, 
these are dominant over atmospheric drag torques, solar radiation 
torques, electrostatic torques, and others. 


4.1 Assumptions for Gravity Torque Calculations 


_ Differential gravity forces acting across the body of the satellite can 
produce torques which tend to rotate the body. These forces exist 
simply because the strength of the earth’s gravity field is a function of 
the distance from earth. A body of finite size must, therefore, experience 
such torques. 

For the purpose of calculating the mean gravity torques, the earth 
is assumed spherical with its radius equal to the equatorial radius. For 
the Telstar satellites, gravity torques are at least an order of magnitude 
less than magnetic torques, and therefore neglecting the earth’s oblate- 
ness produces at worst only a second-order error. 

It is also assumed that the moments of inertia of the satellites about 
all axes perpendicular to the spin axis and passing through the mass 
center are equal. The orbit is assumed elliptical and Keplerian, since 
the earth’s figure produces but second-order effects over a single satel- 
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for each succession of calculations will 


however, 


? 


lite period. The orbit 


rbital elements, which include all secular per- 


be updated using mean o 


turbations, 


to produce the Keplerian orbit of the best fit at the beginning 


of each calculation. 


4.2 Assumplions for Magnetic Torque Calculations 


In order to construct a mathematical model which on one hand rea- 
sonably well represents the physics of the situation, but on the other 


- 


hand does not, by its complexity, produce numerical equations costly 


to compute, the following simplifying assumptions are made: 
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(¢) The magnetic field of the earth is represented by a dipole centered 
at the earth’s center and inclined 6 degrees (equal 11.4°) to the earth’s 
spin axis.” 

(it) Mass symmetry about the satellite’s spin axis exists, as in Sec- 
tion 4.1. 

(iti) The effects of magnetic moments transverse to the spin axis are 
either negligible or average out due to the satellite’s spin. 

(iv) For the calculation of mean or net magnetic torques over an 
orbital period, a Keplerian orbit is assumed as in Section 4.1. 


4.3 Assumptions for Gyroscopic Equations of Motion 


It will be useful in simplifying the gyroscopic equations of motion 
(see Section 4.7) to assume that the satellite angular momentum vector 
coincides with the spin axis and equals the spin rate times the moment 
of inertia about the spin axis. This is the same as assuming the entire 
angular momentum of the satellite is the result of its spin alone, and 
neglects that small amount provided by the precession of the spin axis 
itself. Precessional dampers® are provided on the Telstar satellites to 
prevent coning or rapid changes in attitude at rates comparable to the 
spin rate. This, more than ever, makes the assumption quite reasonable. 


4.4 Coordinate Systems 


It will be convenient to establish certain useful coordinate systems 
and their interrelationships prior to the torque calculations. These will 
be defined and related by Euler-type axis rotations expressed by matrices. 
All coordinate systems are rectangular and right-handed in the con- 
ventional sense such that the rotation of an X axis into a Y axis de- 
termines the positive direction of a Z axis as the direction of progress of 
a right-handed screw. Each system will be named, described, and inter- 
related in that order. 


4.4.1 The Earth-Centered Inertial System (IS) 


In this system, the three mutually perpendicular X, Y, and Z axes 
have their common origin at the earth’s center. X contains the vernal 
equinox and increases positively from the earth’s center in its direction. 
Y is perpendicular to X in the earth’s equatorial plane. Z contains the 
earth’s spin axis and increases positively toward the north celestial 
pole. This is the basic system to which others will be referred. 
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4.4.2 Rigid-Body Systems 


Two rigid-body systems will be used. Both have their origin at the 
satellite mass center. The first of these is to be known as the SANOR 
or satellite nonrotating system. In this system, the z axis defines the 
satellite’s spin axis, being positive in the direction of advance of a right- 
hand screw spinning with the satellite. The x axis is in the satellite 





NN 
TO VERNAL *\. 7 
EQUINOX ee “ 


Fig. 10 — The IS, SANOR and SAR systems. 


equatorial plane, Y degrees rotated from X (see Fig. 10) about Z. This 
axis defines the zy-X Y plane intersection. The y axis is in the satellite 
equatorial plane orthogonal to x. At this point, we note that the rota- 
tional equations of motion for the satellite will be developed in this 
SANOR system. 

The SAR or satellite rotating system, as its name implies, differs from 
the SANOR system in that it rotates with the satellite. It has axes 
x’, y’, 2’, where 2’ coincides with z, and wz’ and y’ are defined as being 
rotated ¢ degrees from x and y respectively about z. 

To relate the SANOR and SAR systems to IS, we proceed as follows 
(refer to Fig. 10). The y rotation about Z yields the following matrix 


1674 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


cosy siny O 
D=\|-siny cosy O}. (11) 
0 0 1 
The @ rotation about z yields 
1 0 0 
C= |0 cos 6 sin 6}. (12) 


0 -—sin 6 cos é@ 
The ¢ rotation about z gives 
cosg sing O 
B={-sing cos¢g O|. (13) 


0 0 1 


To transfer points in the IS to points in the SANOR system, we have 








x xX 
y| = (CD) | Y (14) 
| 2 Z 
or, just to shorten the notation 
x = (CD)X (15) 


where 
(CD) = multiplication of the C and D matrices. 
In like fashion 
a’ = (BCD)X (16) 
= AX. (17) 
where 
A = (BCD) = multiplication of the B, C, D matrices. 


Carrying out this multiplication, we may express A as the following 
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A 
cos Y cos cos ¢g sin y sin ¢ sin 0, 
— cos @sing siny + cos @cosy sing 
— sing cosy — sngsiny cos¢sin@|. (18) 
— cos @sin y cos¢g + cos 6 cos y cos ¢ 
sin 6 sin y — sin @ cosy cos 0 


The matrix CD may be obtained from A by letting ¢ = 0, 
cos sin y 0 
(CD) =|-—sinycos@ coswycosé@ sin@}. (19) 
sin y sin @ —coswsin@ cos @ 


Quite obviously the inverse operations apply by taking the transposes 
of the matrices. Indicating a transposed matrix by the symbol ~, we 
have 


X = (DC)x (20) 
and 


X = Az’. (21) 


4.4.3 Orbital Coordinate Systems 


Let an orbit defining system (ORDEF) be described along the lines 
shown in Fig. 11. Here 2, is the intersection of the orbit plane and the 
earth’s equator plane with + 2, drawn toward the ascending node of the 
orbit from the earth’s center. Axis z, is normal to the orbit plane, posi- 
tive in a direction a right-handed screw along oz, would advance if 
turned in the direction of the satellite’s orbital motion. Axis y, completes 
a right-handed system by being mutually perpendicular to x, and 2, . 

The satellite defining system (SADEF) has its z, axis collinear with 
z,, but x, passes through the instantaneous satellite position, w degrees 
from 2, . Axis x; therefore will be referred to as the local vertical of the 
satellite. 

The ORDEF and SADEF systems are related to the inertial system 
(IS) in a manner strictly analogous to the SANOR and the SAR mat- 
rices described in the previous section. The only change in the mat- 
rices is that 


1676 


THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


Q replaces y 
2 replaces 6 


w replaces ¢. 


Thus we may write 


ty = (CD);:X 
X = (DC) 2, 
ts = AX 
X = Aw, 


where the subscript 7 indicates the Q, 2, w substitution. 


4.4.4 Magnetic Coordinate System 


(22) 
(23) 
(24) 
(25) 


A magnetic coordinate system (MAG) based on an inclined dipole 
model of the earth’s magnetic field is constructed as follows. Let axis 
Xm define the intersection of the geomagnetic equatorial plane with the 
earth’s geographical equatorial plane, positive toward the ascending 
node of the geomagnetic equator (7° from X, see Fig. 12). Axis zm is 
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Fig. 11 — The ORDEF and SADEF coordinate systems. 
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Fig. 12 — The geomagnetic coordinate system. 


normal to the geomagnetic equator plane, positive toward the north 
geographical hemisphere, and 6° from Z. Here 6 is simply the magnetic 
dipole inclination to the earth’s spin axis. Axis ym is in the geomagnetic 
equatorial plane orthogonal to wm . 

Transformations from the MAG system into the IS system proceed 
as in (15) thru (20) with 


n replacing y 
6 replacing 0 


in the matrices, so that 


(CD),.X (26) 


Lm 
and 
X = (DC),2n. (27) 


where the subscript 7 on the matrices indicates the above substitutions. 

Fig. 18 shows the orientation of the earth-centered inclined dipole 
which produces a field of best fit” to a field based on all observations 
of field vectors made anywhere over the earth’s surface. The anomalies, 
or differences between the actual and dipole field, decrease more rapidly 
with increasing heights above the earth’s surface than does the dipole 
field itself, making the fit better and better as altitude increases. 
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Fig. 13 — The earth-centered inclined dipole. 


Mean Gravity Torque 


If the potential energy of an orbiting satellite is expanded about its 
center of mass in a Taylor series and differentiated with respect to the 
angles giving its orientation with respect to the IS coordinate system, 
the instantaneous gravitational torque acting on the satellite may be 
expressed as” 


where 


I 


I= 
TS 


x 
I 


Ty = 8¢°(Is — I) (isk) (i, X &) 


w= GM/r’ 


universal gravitational constant X the mass of the earth 


geocentric distance of satellite 
moment of inertia about satellite’s spin axis, z 


(28) 


(29) 


moment of inertia transverse to z (assuming all transverse 


moments to be equal, i.e., J; = I,) 


unit vector along satellite spin axis, z (more generally the 
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vector along the axis of cylindrical mass symmetry, which is 
the spin axis by assumptions in Section 4.1). 


To determine the mean gravity torque, T, will be integrated over a 
nodal period (7) assuming no precession during that period; 1.e., the 
SANOR system stays fixed with respect to IS during the integration. 
Then 


fp? : : 
(T,)man = 7 f 8s — TG Ge X dt. (80) 
By interchanging the order of the dot product and taking k constant, 

y Tr oe, 

(Ty) ama = BSPED [ie f ie at} x a]. (31) 
Te V0 7 
If a pure Keplerian orbit’ is assumed, we have 

2 
a(1 — e’) (32) 





ro (reine cos. at) 


2.  2mra(i — e*)* 
r Q => 





Th 2) 
Ara’ 
GM = 7 (34) 
where 
r = radius vector from the focus of the elliptical orbit to the satellite 
a@ = semimajor axis of ellipse 
e = eccentricity of the ellipse 
P = argument of perigee 
w = argument of the satellite. 


Hence the integral becomes 


Tn ss w=2r 
[Sa-[ Sas (35) 





and 


62 (AT) 


(T,) mean — T.2(1 — &)! 


(36) 
Qn 
x je. | ii,(1 + e cos w cos P + e Sin w sin P) dw X | 
0 


Since in the ORDEF coordinates (see Fig. 11) 
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i; = cos wi, + sin wy (37) 
the dyad, i,i; becomes 
id, = cos’ w igiy + sin’ w jpjg + sin w Cos w(igjg + jig). (38) 


Substituting this into (36) results in three integrals. Those containing 
igi, and j,j, yield z as a result of the integration. The other integral is 
0. Therefore, 


67° (AT) 


(75) mean = Ta — @)! [k- (igig + join) X kl. (39) 

By some vector maneuvering (see Appendix A) this reduces to 
(Can = — et AD Utes) (ey XH (40) 
To transform the ORDEF vectors into the SANOR system, we write 
Xo = (CD) xX (22) 
X = (DC)x (20) 
“X= (CD) (DC)z. (41) 


Performing the indicated operations, after a bit of labor we find that 
k, = (sin Q sin7 cos ¥ — cos Q sin? sin y)i 
+ (sin Q sin7 sin y cos @ — cosQ sini cosy cos @ + cosi sin 6)j (42) 
+ (sin Q sinz sin y sin 6 + cos sin 7 cos y sin 6 + cos 7 cos @)k 


where, referring to Figs. 10 and 11, we see that 


y = an Euler rotational angle 
~ = orbital inclination 
Q = ascending node of orbit. 


Substituting into (40), the mean gravity torque reduces to 


6r AI as 
Tad a ee a See (Q — p)} 


-{(sin 7 cos @ cos (Q — yw) — cos7 sind)i+ (sinz sin (Q — y))j]. 


CT) toean = (43) 


4.6 The Mean Magnetic Torque 


It is well known” that the scalar potential (€,,) of a magnetic dipole 
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may be expressed as 

















1 p-i, 
P, = 
4p r 
1 pr 
= Amp pe (44) 
— Pp Knit 
Amo - (45) 


where 


Mo = permeability of free space 
4x X 10‘ webers/ampere-meter (Ref. 12) 
4x X 10‘ henry/meter (Ref. 12) 
p = magnetic moment of earth’s field, direction and magnitude 
10’ weber-meters 
= 10'7(10"°/47) emu = 8.06 X 10” emu 
= 8.06 X 10” erg/gauss (Ref. 12, p. 25) 
i, = unit vector along the x, axis, local satellite vertical 
r = distance from dipole center (earth’s center) to satellite 
k, = unit vector describing direction of geomagnetic moment 
p = magnitude of p 
r = vector form of r along 7, axis. 


I 


A satellite magnetic moment (M) in the earth’s magnetic field (H) 
will produce a torque 


T=MXH. (46) 


If the satellite spins rapidly, any magnetic moment components perpen- 
dicular to the spin axis will tend to produce torques which average to 0, 
while the component along the spin axis will produce a net torque of 


Tz = M, XH (47) 
= M.k X H ergs (48) 

where 
M, = satellite magnetic moment along its spin axis (weber-meters 


or ergs/gauss ) 
k = unit vector along satellite spin axis (SANOR system) 
H = geomagnetic field (ampere-turns/meters). 


It is desired to integrate the instantaneous torque, T,, over one 
anomalistic period in order to calculate the mean torque. To do this, 
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we shall express H in the MAG system and eventually convert this 
expression into SANOR terms to be comparable with /, . 
We begin by setting 





H = —V#®,, (49) 

from (45) and (49), 
w= fy | = nin Yd fate) (50) 
Lo r 
where 
Lm 5Ym,2m = components of rin MAG system 

and 

h = p/Arn. (51) 
Therefore, 

= §¥ (en/r*) (52) 

Ho 
and since 
P= (tm + Ye + em)? 

we have 


ne! [Bam2mim + 3Ym@mbm + (Bem? — 1”) Km. (53) 
Mor? 


Rewriting to spotlight r components, using the following relationships 
normalized to r, 


Lin} t =1:z, Ym/T = ly, Zm/T a he (54) 
one obtains 
H = — nS [3r21elm + Br yal + (3r7 = 1)k,,] (55) 
Mor 
or 
p . e 
= mo s° Bm ) 4s 56 
H Tena 3 (is: Km) is] (56) 
since 


| eae poe (57) 
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By the following equations [(24) through (65)], the variables i, and 
k,, will be expressed in forms useful in the ensuing mean magnetic torque 
calculations. 

Let us first express i, in the MAG system. As shown previously, 


Cp AX (24) 
and 

X = (DC) ,Xm (27) 
therefore 

t= Ad DG), Xa (58) 
or 
i, = | (cos Q cos w — cos7 sin w sin Q) 

-(cos w sin 2 + cos 7 cos 2 sin w) (sin w sin 7) | 
cos yn —sin y cos B sin 7 sin B | | im (59) 
xX | sin y cos n cos B —cos 7 sin B | | jm 

0 sing cos 8 | Kn | 

Expanding, 


i, = [cos n(cos 2 cos w — cos 72 sin w sin Q) + sin n(cos w sin 2 
+ cos7 cos Q sin w)]in + [—sin 7 cos B(cos 2 cos w 


— cos7sinwsinQ) + cos 7 cos B(cos w sin 2 


+ coszcosQ2sinw) + sin 8 sin w sin 2|jm + [sin 7 sin 8 oe) 
(cos 2 cos w — cos7 sin w sin 2) — cos 7 sin (cos w sin Q 
+ cos 7 cos 2 sin w) + cos 8 sin w sin 2|Ky « 
This will be used later. 
Also, i, in the IS system may be written [see (24)] as 
i; Ail (61) 
i, = [cos Q cos w — cos72 sin w sin QI 
+ [cos w sin Q + cos7 cos Q sin wlJ (62) 


+ [sin w sin 2|K. 
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In a similar manner, k,, in the IS system is expanded as follows: 


tm = (CD) ,X (63) 

or 
k,, = sin 7 sin BI — cos7sin BJ + cosBK (64) 
= sin B(sin 7 I — cosy J) + cosBK. (65) 


The problem is now to integrate the instantaneous torque, T, , over 
one anomalistic period, 7", , to obtain the mean magnetic torque, 
1 tytT 1 vy+2r di 


T,dt = — 


ie t; i My a gs 


(Tis) inein = 
1 Vgt2r r (66) 
= Tn h dv 


T. CP 
where 


T, = anomalistic period of the satellite 
T,, = instantaneous magnetic torque 

¢t = time 

v = true anomaly of the satellite 

r = geocentric satellite distance 
defined by (68) below. 


> 
ll 


Substituting into (66) from (48) and (56) 


Veter ~ o3 . 2 
(T;,) mean = eal k x / Hn = 3 Re) (5) a: (67) 





Atpol'a i r3 
Since 
_ Qra?(1 — e)? 13 
h= — (68) 
and 
ay, a(1 — e) 13 
~ 1+ ecosv (69) 
where 


a = semimajor axis of orbit 
= orbital eccentricity 
T, = anomalistic period of the satellite 


& 
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(67) becomes 


M,p 


(LA) mean — Sx2na8(1 — e)? 


vyt2r (70) 
KX i. iG eesti 


~t2r 


= ok x fT fe SG eS e comes a) 


First we shall evaluate the integral of (71) beginning by substituting 
the k,, value given in (65), to obtain 


qtr 


= = (sin B)k Xf [sin gl — cos nJ — 3{%,- (sin a 


— cos nJ) }i](1 + e cos v)dv (72) 
+ (cos B)k X i {K — 3(i,-K)i,}(1 + e cos v)dv 
=I1+I10 (73) 


Integral II, the simpler of the two, is evaluated in Appendix B. The 
result is given below: 


Il = 7 cos B[{ —2 sin 6 + 3 sin’ 7 sin 6 
+ 3 sin z cos 7 cos 6 cos (Q — p)}i (88) 
+ 3 sin7coszsin (Q — p)jj. 


Now we must evaluate the first integral, I, of (73) but first let us 
state that by examining (72) it is perfectly obvious that the I integral 
does not exist for a noninclined dipole. For this case, it follows from 
(73) with cos 8 = 1 that 


M.p 
33001 ei II. (89) 
By suitable variable substitution the first integral I of (73) may be 
compressed to yield 


I = (sin B)k X (IA — JB — 31-C + 3J-E) (90) 


CT) mean = 


where 


vyt2r 
A= i sin 7(1 + e cos v)dv (91) 


7 


1686 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


vy; +2r 

B= | cos n(1 + e cos v) dv (92) 
Vyt29 

ome | Eines ce cana (93) 
vitor 

E = iis cos n(1 + e€ cos v) dv. (94) 


Now 7 must be expressed as a function of v in order to evaluate A, 
B, C, and E. For simplification of the integrals we shall also let the 
initial time for the integration be the time when the satellite passes 
through perigee. Then 


7 = ro +b wnt (95) 
= 0 + wel M /20 (96) 
= n + OM (97) 


where 


no = initial position of the ascending node of the geomagnctic 
equator at perigee passage with respect to the IS system (see 
Fig. 12) 

wy = angular velocity of the earth in the IS system 

¢t = time measured from passage of satellite through perigee 
mean anomaly of the satellite (radians) 
T, = anomalistic period of the satellite (time units) 
= (wz/2r)T, = number of turns of earth in time T,. 


5 
I 


Therefore 
sin 7 = sin mo cos (bM) + cos mo sin (bAT) (98) 
cos 7 = COS m cos (bM) — sin no sin (DM). (99) 
Unfortunately, M is related to v through Kepler’s equation as 
M=E-—esinE (100)" 
where 
i = the eccentric anomaly 
é = eccentricity of the orbit 
and 





ex 1—e\} v 4 v 
7 eon eee oy. 14 
E = 2tan ( i ‘) tan | tan (« tan *) (101) 
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Equations (100) and (101) certainly define M as a function of v, but 
in a most complicated manner. It can be shown, however, that a plot of 
v and AM versus time normalized to 7, will look like Fig. 14. So as a 
reasonable approximation to M7 we might consider 


M =v — (A/b) sin v (102) 
where 


\/b = maximum amplitude of the true anomaly “sine” wave of 
Fig. 14 = 2e.” 


Note that (102) resembles Kepler’s equation with v replacing /. With 
less sophistication we might even let 


M =». (103) 


The only justification here is that we shall be dealing in mean torques 
averaged over an orbital period, and the v function makes one oscilla- 


TY 
ee 
m/e 


M 
ANOMALY / 


360 






320 






280 











ANOMALY IN DEGREES 











Fig. 14 — Mean and true anomaly comparison. 
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tion about the M function in that time. If (103) is assumed, (98) and 
(99) remain unaltered except that v replaces M. If (102) is used, (98) 
and (99) become 


sin 7 = sin yolcos bv cos (A sin v) + sin bv sin (A sin v)] + 
cos 7o[sin bv cos (A sin v) — cos bv sin (A sin v)] (104) 
= sin no[A’] + cos 0[B’] (105). 
where A’ and B’ are defined by comparing (105) to (104). In like 
fashion, 
COS 1 = COS m[A’] — sin |B’). (106) 


We may express the sin (\ sin v) and cos (A sin v) portions of A’ 
and B’ as 


3. 3 
sin (A sinv) = Asinv — . aa (107) 
and 
: : »’ sin 7v 
cos (Asinv) = 1 — a ee (108) 


These approximations are reasonably valid for near earth satellites 
not having high eccentricities and avoid Bessel function complications. 
The neglect of higher-order terms in (107) and (108) for e = 0.25 and 
T, = 150 minutes results in errors of less than one part in 10°. 

As case I we shall evaluate the integrals A, B, C, and E using (98) 
and (99) with v replacing 17. For case II we shall return to evaluate 
these integrals again using (105), (106), (107), and (108). Case I 
details are given in Appendix C. Case II is outlined by Appendix D. 

Using the expanded II integral of (88) and the evaluated A, B, C, 
and E integrals from Appendices C and D, (73) may now be written as 


20 
Cl en = a eos Bk X / {k — 3(i,-K)i,}(1 + e cos v)dv 
0 


+ sin Bk x {AI = BJ — 3I- (ijigCy ae jadoC'e 


ie ; ; (130) 
= (i,j, - joig) Cs) aa oJ: (ii, L1 + jolo le 


+ (igjg a joi.) Es) | 


where 


ATTITUDE DETERMINATION AND PREDICTION 1689 


A, B, Ci, Ce, C3, £1, He, Es are all defined in Appendix C. 


The following, expressed in the SANOR system, will now be substituted 
into (1380) (refer to Fig. 11) 





k X I = cosyj + sin y cos 6i (131) 
k X (—J) = cosy cos 6i — sin yj (132) 
J-i, = sinQ (133) 
J-j, = cos 2 cos 2 (134) 
I-i, = cos Q (135) 
I-j, = —sin 2 cos7z (136) 
k X i, = —cos @sin (Q — W)i + cos (Q — w)j (137) 
k X j, = (—sinzsin 6 — cos 7 cos @ cos (Q — w))i 
— cost sin (Q — y)j ee, 
k X K = —sin fi. (139) 
These yield the mean magnetic torque, which is 
(Tar) mean = maa \ cos Bli{ —2 sin @ + 3 sin’ 7 sin @ 
+ 3 sin 7z cos 7 cos 6 cos (Q — p)} 
4 3{3 sin ¢ cos 7 sin (O — wilt 
ety ei A[sin y cos 6i + cos yj] 
+ Bleos y cos 6i — sin yj] (140) 


+ 3] {2, sin Q + E; cos 2 cos 7 — Ci cos Q 
+ C3 sin 2 cos 7}{—cos 6 sin (Q — p)i 
+ cos (Q — y)j} + {2 cos 2 cost + FE; sin 
+ Cy sin Q cos i — C3 cos Q}{— (sin 7 sin 6 
+ cosi cos 6 cos (Q — w))i — cosi sin (Q — vail 
or, collecting on 9% cos B and 9 sin 8, 
(Tac)mean = 431 cos | i{— 2sin @ +3 sin’ isin @ +3sinécosi (141) 
- cos 6 cos (Q — w)} + j{8 sinz cos7 sin (Q — yy | 
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+ 9 sin ali} sin y cos 6+ B cosy cos 6 
— 8[cos @sin (Q — w)(L,sinQ + EF cos Q cos 7 
— C,cosQ + C3 sin Q cos 7) + {sin 7 sin 6 
++ cos 7 cos 6 cos (Q — w)} (4, cosQ cost + 13 sin Q 
+ CysnQ cost — C3 cos a] -- j}A cos W 
— Bsin » + 8leos (Q — W)(£, sin Q 
+ EH; cos Q cos 7 — Cy cos Q + C3 sin Q cos 7) 
— cosz sin (Q — W)(y cos Q cos7 
+ 3 sinQ + C2 sin Q cos7 — C3 cos Q)] |. 


Using the magnetic moment program, it has been shown in the case 
of the Telstar satellites that letting JJ = v produces precessional results 
that follow the case II approximation over 1,000 orbits to within 0.01°. 
To document this result, the residual magnetic moment used was —0.9 
microweber-meter, spin rate equaled 20 to 10 radians per second, spin 
axis and transverse moments of inertia were 4 slug-feet’, orbit perigee 
was set to 4,500 miles, and eccentricity ranged from 0 to 0.95. Slight 
changes in the z-y torques of the order of thousandths of a microfoot 
pound were observed as the principal differences in the case I and II 
approximations for these eccentricity ranges. While these differences 
are negligible for the Telstar J and II satellites, other satellites in suffi- 
ciently lower orbits or having greater residual magnetic moments could 
require the case II approximation (no complete study has been made 
to date to bound the required ranges of the above mentioned variables 
for case I to achieve agreement to within 0.01° of case II). 


4.7 Equations of Motion of a Body Symmetrical about Its Spin Axis’® 


The mean gravity and magnetic torque equations have been derived 
in the previous section. To analyze the motion of a spin-stabilized 
satellite responding to these torques, certain gyroscopic equations must 
now be developed. We begin by relating the vector angular momentum, 
3c, for any rotating body to the external forces acting on the body as 


eR’ =N (142) 
where 


N = resultant moment of all external forces acting on the body 
sc’ = the time derivative of 3c. 
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Using the assumptions in 4.3, we write in the SANOR system 
5K = ok (143) 
where 
@ = the satellite spin rate. 


From this, it follows that (sce Fig. 10) 


se’ = I,[¢k + ¢(o X k)] (144) 
where 
w = angular velocity of the SANOR system referenced to IS co- 
ordinates. 
We may express w in the SANOR system as (see Fig. 10) 
O = wr + w,j + wk (145) 
or 
wo = 6i + ¥sin 6j + ¥ cos 6k (146) 
where 


We, Wy, Ww = angular velocity about the a, y, and z axes respectively. 
Combining (142), (144), and (146), after performing the indicated 
operations 


N = 13(gf sin 6i — ¢6j + ¢k). (146) 
Expressing (146) as 2, y, and z torques in the SANOR system, 
T, = Isof sin 6 (147) 
T, = —I3¢6 (148) 
T, = Is. (149) 


Quite obviously it is the x and y torques which produce precession. 
These torques, by the assumptions of Section IV, are the sums of the 
x and y components of the gravity and magnetic torques expressed by 
(43) and (141). T. is zero as a result of assuming zero magnetic moment 
transverse to the spin axis and mass symmetry about that axis. We 
note that a nonzero 7’, implies a change in the spin rate. For the Telstar 
I and II satellites this takes place principally because of induced eddy 
currents which produce transverse moments.” 


* The general equations of motion referred to the center of mass for a rigid 
body spinning about the z axis, and symmetrical about this axis, are 
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4.8 Alternate Inertial Coordinate System 


Clearly, (143) is valid only if the angular momentum vector, 5, 
coincides with the satellite spin axis k. We note in (147), which 
was derived from (143), that the above assumption will be approxi- 
mated only if ¥ sin 6 is small as compared to ¢. Otherwise the direction 
of 3¢ shall certainly be influenced by that term as well as ¢. We note 
too that a valid condition within the bounds of the assumption is for 
y sin 6 to be small even with large ¥ provided only that 6 itself be ap- 
propriately small. That is, solutions to the equation of motion exist 
for large ¥, and those will occur only for small 6 because of (143).* 
But if @ be near 0° or 180°, a singularity exists in (147), for ¥, as a 
result, approaches infinity. An exit from this dilemma may be secured 
simply by transforming to a new reference set of inertial coordinates in 
place of the IS system whenever @ becomes small. Naturally, the new 
set should be chosen so that the equivalent @ then existing will be 
large. This is accomplished by redefining IS so that 


X2 corresponds to Y 
Y. corresponds to Z 
Ze corresponds to X 


as shown in I’ig. 15. Let us call this new inertial frame the IS2 system. 
Transformation equations to relate IS to IS2 are quite simple and 
are given below: 


X_ = QX (150) 
X = QX, (151) 
T, = 16+ (3 — Dy? sin 6 cos 6 + Igy sin 0 
Ty, = Iv sin @ + (21 — Is)¥6 cos 6 — 1366 


T. = I3(f cos 0 — ¢6 sin 6 + %). 


For cases where ¢ dominates , 6 sin 6, and ¥ cos 6, these torques reduce to 
those given by (147), (148) and (149). We note principally that a change in spin 
rate reflects a 7, processional torque, from the above equations, but this is 
generally small and is neglected in this paper. For Telstar I, this torque compo- 
nent is estimated at least two orders of magnitude below the magnetic torques 
considered herein. (See also Ref. 17.) 

* This is the case for precession through or near the north or south celestial 
pole, where even small changes in attitude, or 6, can produce large changes in y. 
The situation is quite analogous to an azimuth-elevation antenna tracking a 
satellite that passes through or near the zenith, where the azimuth rates become 
extremely high even for small changes in satellite position. 
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Fig. 15 — Euler rotations in the alpha-gamma system. 


where 
01 0 
Q= 10 0 1 (152) 
10 0 
and 
7 00 1 
Q= |]1 0 O}. (153) 
0 1 0 


Relationships between the other coordinate systems and the IS2 system 
may be obtained using equations of Section 4.4 and applying the Q 
matrix as appropriate, except that in the expanded matrices 


a corresponds to y 
y corresponds to 6 


where 
a = The Euler angle measured from X2 to the intersection of the xy 
and X2Y»2 plane, which defines a new x axis called the 2 axis in 
a SANOR 2 system. (See Fig. 15.) 


1694 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


y = The Euler angle measured from Z, to k in a plane perpendicular 

to the x2 axis. (Rotation is about the x2 axis.) 

Quite obviously, the steps leading to the mean gravity and mean 
magnetic torques may be retraced using the Q matrix and the new 
a,y Kuler angles. This would lead to torque expressions as functions 
of these desired angles. The same result may be obtained by transforming 
the final torques expressed in the y-@ system to expressions in the a-y 
system by using explicit relationships between these two systems. We 
shall choose this later route and the needed relationships will now be 
derived. 

We note in Fig. 15 that the k (satellite spin) vector remains in the 
same spatial position whether expressed in the IS or IS2 system. It 
seems reasonable then to proceed to relate a-y to y-6 by observing the 
projections of the k vector on the XY and X2¥Y2 planes, respectively. 

Referring to Fig. 16, we see that 






pirate (157) 
—cos @ 
cos y = sin @ sin y (158) 
siny = (1 — sin’ @sin’ py)’. (159) 
Yo OR Z 
KSI SING, KSINY SIN@ 









HSINY Cosa 


SSS Se ee ee X2 OR Y 


Z2 OR X 


Fig. 16 — Theta-psi and alpha-gamma geometric relationships. 


ATTITUDE DETERMINATION AND PREDICTION 1695 


Quite obviously, inverse relationships may be developed. These are 


cos Y 


tan y = ———___—— (160) 
—sin a sin y 

cos 6 = —cosasiny (161) 

sin @ = (1 — cos’ asin’ y)'. (162) 


4.9 Mean Gravity and Magnetic Torques — Alternate Euler Rotations 


Using (157) through (159) it can be shown that the mean gravity 
torque expressed in the a-y system is 


6r' Al 


Fai & (—sin Q sin 7 cos y 


(Ta) mean — 


+ cos Q sini sina sin y + cos7z cos a sin y) 
lix(sin Q sin 7 sin y + cos Q sin 7 sin a cos y ae) 
+ cos 7 cos a cos y) + je(cos 2 sin 7 cos a 
— cos? sin a)]. 
Likewise, the mean magnetic torque can be expressed as 
(Tn) mean = OI COS al ie} —2 cosacosy + 3 sin 7(sin 7 cos a cos y 
— sin 2 cos 7 sin y — cos 2 cos 7 Sin @ cos ”)t 
+ jo}2 sin a — 3 sin 7(sin 2 sin a 
+ cos 2 cos 7 cos a) t 
+ 9 sin 6| ie) —Asiny 
(164) 
— Bsinacosy + 3G,(sin 2 sin a cos y 
— cos Q2 siny) + 3G2(—sin 7 cos a cos y 
+ sin 2 cos 7 sin y + cos Q cos7 sin a sin y) 
+ jo} —B cos a + 3G; sin Q cos a 
+ 3G2(sin 7 sin a + cos 7 cos 2 cos a) t] 
where 
G, = Hy, sin Q + #3 cos Q:eos t — Cy cos Q + Cz sin Q cost (165) 
G. = Ey cos 2 cost + #3 sin Q + Cy sin 2 cos i — C3 cos Q. (166) 


I| 


All other terms have meanings described previously. 
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V. EXPERIMENTAL RESULTS -— GENERAL 


Using the methods described herein, the attitudes of the Telstar I 
and II satellites have been predicted, determined, and these two results 
compared. In the case of the Telstar I satellite, the maximum angular 
deviation between predicted and determined attitude is 0.90° with the 
special-case exception of pass 2154, which is covered in detail later.* 
The average angular deviation is 0.88°. The predicted attitude of the 
Telstar II satellite deviates from that determined by optical flashes and 
solar sensors by a maximum of 0.5° and an average of 0.09°. These 
results are tabulated in Tables IJ and III. Here the deviation in right 
ascension and declination is displayed by subtracting the predicted 
values from the determined values. The angular deviation between pre- 
dicted and determined attitude is also given as the celestial great circle 
are G. When this value is followed by N, this indicates that no attitude 
fix has been determined by loci intersections (see Section II), and so 
the shortest are distance between the corresponding attitude locus and 
the prediction is quoted. The T preceding an attitude determination indi- 
cates the selection of that point as a target toward which a magnetic 
moment program embodying the above analysis attempts to converge. 


5.1 Hxperimental Results — The Telstar I Satellite 


The Telstar I satellite entered orbit on July 10, 1962. Its attitude, 
calculated by combining nominal third-stage burnout parameters and 
stage 1 and 2 telemetry data, is given in Table I. Between launch and 
pass 16, there were a number of instances when the orientation coil was 
inadvertently energized by misinterpreted command signals sent to the 
satellite at extreme ranges and/or low elevations. This orientation coil 
(often referred to as a “torque coil’’) is located in the equatorial plane 
of the Telstar satellite just under the outer skin. Its purpose is to enable 
attitude adjustments by the production of a magnetic moment whenever 
current is sent through the coil. A fixed amount of current may be caused 
to flow in either direction through the coil so as to produce a magnetic 
moment, in addition to the residual moment, of --7.8540 microweber- 
meters. Fig. 17 diagrams the sense of the magnetic vectors which are 
called positive, indicates the resulting north and south magnetic poles 
of the satellite, and shows the corresponding positive direction of the 
current flow in the orientation coil. The relative magnetic directions and 
current flow are in accord with established standards. 


* There are apparent exceptions on passes 16, 72 and 1657, but these relate to 
antenna pattern techniques (see Section 5.1) which have an expected accuracy 
of +1° to + 2°. 
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aed ; ; i es 
pre etl Irvpe Attitude Prediction] et °’ Bredhet,) MAG 
Pass Number for MOM 
De- (MWM) 
RA DEC |term.) RA DEC RA DEC G 
Burnout 83.7 |—66.8 
16 84.2 |—65.8 | SM| 84.2 |—65.18} 0 0 0 —0.86328 
16 83.0 |—66.0 | A | 84.2 |—65.18 |—1.2 |—0.82] 1.4 
72 87.78] — 66.27 0.5N 
72 86.0 |—67 A —1.3 |—0.73) 1.2 
135-136 91.7 |—66.0 | M | 92.19}—66.36 | —.49} 0.36] 0.6 
199 single locus M | 96.49/—65.95 0.4N 
271 100 —65 A |100.80/—64.92 |—0.8 |—0.8 | 0.5 
272 single locus M |100.90|—64.90 0.6N 
272 S |100.90)—64.90 0.2N 
472 107 .50]—59.80 0.2N 
931 T100.5 |—49.84; M {100.74;—49.93 |—0.24; 0.09] 0.18 
—0.86328 
1051 single locus 97 .25|—50.21 0.9N |+6.9907 
—0.86328 
1069 98 .42|—49.70 0.6N |—8.7173 
1114 95.92)/—51.01 0.1N |—0.71103 
1430 92.62] —59.55 0.1N 
1567 T96.2 |—64.2 | M | 96.71/—64.21 |—0.51]—0.01] 0.28 
—0.71103 
1657 99.2 |—67.0 | A |102.81)/—66.64 |—3.61) 0.36] 1.7 7.14297 
—0.59101 
1695 78.58|—57.01 
1909 90.59)— 59.71 0.7N 
2154-55 T104.0 |—53.5 | M |104.71)—54.81 |—0.71)/—1.3 | 1.5 
2200 single locus 106.03] — 53.28 0.1N 
2264 107 .20}—50.74 0.4N 
2464 107.09)—43.65 ON 
2482 106 .88]— 48.04 ON 
2509 106.57) —42.33 Oo N 
2582-83 105.2 |—40.2 | M |105.44|—40.53 |—0.24| 0.38) 0.4 
3340, 41, 42) 106.3 |—43.7 | M |103.68/—61.54 |—2.62] 17.8 {18.0 
3476-7 112.30 |—46.32| M |120.21|/—66.21 7.91; 19.9 |22.0 
3495 123.16/—66.41 
3476-7 T112.30 |—46.32] M |112.30/—46.32t| 0 0 0 —0.40945 
M Mirror flash data. 


A 


Antenna pattern data. 
8 Solar sensor data. 
+t Connects 3340 to 3476. 


Data relating to the inadvertent uses of the orientation coil are insuffi- 
cient to reconstruct the detailed precessional motion of Telstar I from 
launch to pass 16. Cn pass 16, however, an attitude determination was 
made by reducing optical and solar sensor data. This along with an 
attitude fix on pass 135-136 made from mirror flash data indicated a 
magnetic moment of about —0.7 microweber-meter. This was later 
refined to — 0.86328 microweber-meter by causing the magnetic moment 
program to connect, by the precessional theory herein described, the 
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Attitude Determin- Attitude Deviation 
ation ‘Thne Prediction (Det.-Predict.) MAG 
Pass Number Dita MOM 
(MWM) 
RA DEC RA DEC ARA | ADEL G 
Burnout 82.23]—57.31 
62, 63, 63 87.75 |—55.08} M | 87.75)/—55.08} 0 0 0 —0.48375 
100 88.55}—55.05) 0 0 0 —0. 48375 
132, 138, 133 88.75 |—55.34| M | 89.23)—54.97/—0.48] —0.37,;—0.5 
281 92.17|—54.10 ON 
293, 293 92.3 |—54.05) M | 92.38)—53.99}—0.08/—0.06| 0.1 
324, 25 92.40 |—53.71}| M | 92.90/—53.71| 0 0 0 
3831 93.01] —53.66 0.08N 
490 94.95|—51.81 
496 T95.00 |—51.73} M | 95.00;—51.73 ON 
541 95.34|—51.10 ON 
573 95.49) 50.74 ON 
611 95 .63]— 50.20 0.138N 
643 95.71)—49.80 0.16N 
M = Mirror flash data. 
S = Solar sensor data. 


attitude on pass 16 to a point on the pass 931 attitude locus. The con- 
vergence is within 0.18°, as shown in Table II. Using this latter magnetic 
moment, the precessional motion from passes 16 to 1051 was established. 
This precessional history is given in Tig. 18, where both the predicted 
curve and the attitude loci data are displayed. 

When an optical flash series is recorded at the Holmdel Laboratories, 
the midpoint of that series can be determined with a time accuracy of less 
than --20 seconds when using visual observation through the on-site 
telescopes and within +10 seconds photoelectrically. The early data 
recorded on the Telstar I satellite did not indicate the expected accuracy 
at the midpoints, so in Fig. 18 only central loci are plotted. We must 
therefore think of these loci as having possible tolerances up to +10 
seconds, since photoelectric data were reduced. A tolerance of +10 
seconds can cause the maximum locus limit to be displaced as much as 
+0.7° on either side of the plotted central loci. Considering this, the pre- 
dicted attitude curve passes through every loci and every loci intersec- 
tion on Fig. 18 with the single exception of pass 1051. The prediction 
curve would miss the lower limit of this locus by 0.3°. 

Extended attitude predictions were made on October 1, 1962 (around 
pass 800) and covered the period through January 22 (pass 1800). 
These indicated that attitude adjustments should be initiated during 
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January, 1963. This was needed to prevent the solar offset from exceed- 
ing about 15° as required (see Section I). In order to check the attitude 
correction procedures, a test maneuver was initiated on pass 1051. It 
began by turning on the orientation coil, so as to produce a magnetic 
moment of +7.8540 microweber-meters to oppose the — 0.86328 residual 
moment and yield a resultant of +6.9907 microweber-meters. This was 
continued through pass 1058, whereupon the coil was turned off until 
pass 1069. On 1069 the coil was turned on in the opposite sense so as to 
produce a moment of —7.8540 microweber-meters, giving the satellite 
a net moment of —8.7173 microweber-meters. On pass 1075, the coil 
was turned off. All told, the coil was positive for about 18 hours and 
negative for about 16 hours, a fact which approximately nullified the 
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Fig. 18 — Telstar I precessional history — passes 16 through 1051. 


precessional effect of the coil as intended for this test. Details of this 
maneuver are shown in Fig. 19. 

Table II summarizes the agreement of loci to prediction during this 
period. Giving loci of unknown tolerances a -0.7° spread and using the 
known tolerance for pass 1114, the residuals shown in Table IV are 
produced. 

The magnetic moment program connected the attitude on pass 1075 
to that determined by mirror data on pass 1567 within 0.23°. In so 
doing, it calculated a residual magnetic moment of —0.71103 micro- 
weber-meter for that era. Using this calculated moment, predictions are 
extended to pass 1657 where, on January 7, 1963, the main orientation 
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Pass Number Deviation of Prediction from Optical Locus 
1051 0.3° 
1069 0° 


1114 0.1° 
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maneuver began. The region from pass 1075 to 1657 is shown in Fig. 20 
along with the attitude loci. Predictions fit the determinations through- 
out within about 0.2°. 

In addition to using mirror flash and solar sensor data to determine 
attitude, W. C. Jakes, Jr. and group at the Holmdel Laboratories 
deduced the attitude for a number of passes in this period by analyzing 
antenna pattern data.’® Some of these determinations are included in 
Table II. Since their expected accuracy is +1°, these data also serve 
to corroborate the predicted attitude. 

The orientation coil was energized for the major attitude maneuver 
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Fig. 20 — Telstar I precessional history — passes 1075 to 1657. 
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beginning on pass 1657, January 7, 1963, and ending on pass 1695, 
January 11, 1963. During this time the residual magnetic moment of 
— 0.71103 microweber-meter had superimposed upon it a 7.8540 micro- 
weber-meter moment as a result of the coil. The attitude of the Telstar 
I satellite was changed by over 20° during this maneuver. The attitude 
and curve for this period are given in Tig. 21. The slight ripples in these 
curves are due to the inclination of the earth’s dipole. Because of the 
small seale of this plot, these ripples are more noticeable than in the 
previous graphs. 

At this point, we might well insert a brief discussion of the planning 
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Fig. 21 — Telstar I precessional history — passes 1657 to 1695. 
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behind this attitude maneuver and why pass 1657 was chosen for its 
commencement. First of all, it can be shown that for any angular offset 
in the spin axis from a fixed, desired orientation there exists, for each 
instant in time, a determinable polarity of voltage for the equatorial 
coil which will tend to decrease the offset. To utilize this principle fully 
would require turning on the orientation coil on many occasions when 
the Telstar I satellite entered the Andover skies. While such a procedure 
would tend to decrease the angular offset generally in the most direct 
fashion, there are occasions in the lifetime of the Telstar satellite when 
leaving the coil on for a number of revolutions will produce faster cor- 
rections than pulsing the coil continually at Andover. This is because the 
instantaneous torque produced by the continuous magnetic moment of 
the coil in the earth’s magnetic field varies during an orbital period in 
such a way that, in general, the average value exceeds the torque obtain- 
able by energizing the coil only in the Andover skies. The optimum pro- 
cedure then is for a period of continuous operation and this was under- 
taken. Tig. 22 shows the results of turning on the coil for steady torqueing 
at various dates. The heavy spiral shows the predicted position of the 
spin axis in right ascension and declination if the coil is never energized 
(the trial maneuver is omitted here for simplicity). The diverging curves 
indicate the motion due to the torqueing coil. Examination of Fig. 22 
shows rapid precession as a result of the torque coil field, so that no mode 
of continuous coil operation will long permit the spin axis to point close 
to the south ecliptic pole. The solar offset also must ultimately increase 
with the enlarging ecliptic angle. It therefore follows that if a simplified 
mode of coil operation exists, it must begin with a relatively short period 
of coil operation. 

Two factors concerning the precessional motions are noted in Fig. 
22. They are: 

(z) All precessional motions involving the orientation coil lie outside 
the spiral of residual precession for many orbits, because the net mag- 
netic moment existing when the coil is actuated exceeds the residual 
magnetic moment. 

(iz) Torque coil precession for both positive and negative coil polarity 
begins incrementally at the spiral in opposite directions. 

We see that the torque coil should be used sparingly, because of the 
relatively rapid motion it produces. The problem that remains is to 
determine the times to turn on and turn off this coil and also to investi- 
gate whether or not future coil use is required. Because the spin rate of 
the satellite is decreasing, equal magnitudes of satellite magnetic mo- 
ments will cause greater and greater precessional motion as time passes. 
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Fig. 22 — Telstar I predicted attitude showing steady torqueing. 


It is to be expected that after the coil is actuated and then turned off 
a spiral of larger excursion than that occurring for passes 16 through 1655 
will result strictly because of the lower spin rate existing at the later 
time. There is no way to avoid this except by repeated coil pulsing. 
To avoid, or at least minimize, repeated coil usage, the procedure is 
to try to keep the maximum ecliptic angle as small as possible. If this 
value is expected to exceed the allowable solar offset, it is advantageous 
to time the operation so that the maximum ecliptic angle will occur at 
the time of minimum solar offset. This criterion governs both the selec- 
tion of the day for coil turn-on as well as the duration of the attitude 
maneuver. As may be guessed from Fig. 22 and from the precessional 
motion factors stated above, only short torqueing operations during the 
month of January would result in the next precessional spiral (after coil 
turn-off) returning anywhere near the initial spiral shown in Fig. 22. 
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To cause the maximum ecliptic angle and the minimum solar offset to 
occur together, orientation coil use had to begin in the period from 
January 1 to January 7, 1963. As shown above, the maneuver which 
did begin on January 7 placed the attitude at right ascension 78.6° and 
declination at 57.0° by pass 1695. 

The magnetic moment program, in connecting pass 1695 attitude to 
that determined by mirror data on passes 2154 and 2155, determined a 
residual magnetic moment of —0.59101 microweber-meter. This differs 
by about 0.1 microweber-meter from the residual moment going into the 
torqueing maneuver, but appears to be borne out by the closeness of 
fit to the mirror data from pass 1695 through pass 2583 (see Table II).* 
Trig. 23 plots this trajectory along with the mirror loci. The fit of attitude 
predictions to determinations is very good from passes 1695 through 
2583 (see Table IT). The largest deviation of 1.5° on passes 2154-2155 
may be somewhat misleading. Referring to Fig. 23, we see that the 
predicted attitude on pass 2154 is easily within 0.1° of the 2154 locus. 
The interaction of the 2154 and 2155 loci however, produces a common 
area, or attitude box, 1.5° from the pass 2154 prediction. This box is 
produced by a somewhat grazing intersection of the loci, and its location 
is therefore affected by the precession which took place between these 
two passes. Antedating the 2155 locus to account for this would place 
the attitude box within 0.05° of the 2154 and 2155 prediction, so this 
fit is quite valid. 

We are not in so comfortable a position for passes 3340 and there- 
after. Attitude boxes for 3340-1 and 3476-7 are shown as dashed lines 
on Fig. 23. They are about 20° from the corresponding predicted atti- 
tudes, and the reason for this discrepancy is at present unknown. We 
note that the Telstar I satellite ceased its transmission on pass 2065 
(February 20, 1963) because of radiation damage, but that the attitude 
remained predictable at least through pass 2583. The region from pass 
2583 to 3340 is devoid of attitude data because no telemetry could be 
received to report solar aspect and no mirror flashes were recorded. This 
was due to the increased activity on Telstar II, a certain proportion of 
Telstar I passes occurring in daylight hours, and prevailing weather 
conditions at Holmdel. 

Interestingly enough, it is noted that the attitude determinations on 
passes 3340-1 and 3476-7 can be connected by the magnetic moment 
program even though they cannot be sensibly joined to the preceding 


* It is possible that coil usage can alter the residual moment of the satellite, 
which was made small in the first place by appropriately balancing much larger 
magnetic fields.' 
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Fig. 23 — Telstar I precessional history — passes 1695 to 3500. 


data. Fig. 24 shows this union. The magnetic moment of —0.59101 
microweber-meter used in these last three plots cannot be considered too 
reliable, since it is one of a number which can connect the two attitude 
boxes shown in Fig. 24. However, it is the same as previously used in the 
1695 to 2583 region and may tend to be valid because of this. In any 
event, more attitude data are required to resolve this issue. 

As to what may have caused the anomalous behavior of Telstar I 
somewhere between pass 2583 and 3440, we offer the following, taken 
singularly or in combination, as possibilities: 

(i) the orientation coil has been energized during this period (from 
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(it) the residual magnetic moment has changed a multiplicity of 


times; 
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SPIN AXIS RIGHT ASCENSION.IN DEGREES 
Fig. 24 — Telstar I precessional history and prediction — passes 3840 to 5000. 


the present precessional motion, it appears to have been turned off 
(cit) meteoric collision has taken place; 


between 3340 and 3477); 


dina 


(wv) pressure leakage from instrument canister has occurre 


manner to alter the attitude by reaction forces. 


-stage burnout parameters is shown 


5.2 Experimental Results — The Telstar II Satellite 
The Telstar II satellite entered orbit on May 7, 1963. Its initial 


attitude calculated from the third 
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in Table III. The predicted burnout attitude proved inaccurate because 
the third stage of the Thor-Delta vehicle did not perform nominally. 
This is evidenced by the fact that the orbital period calculated from 
such assumed nominal performance differed from the actual period by 
as much as four minutes. 

The first attitude fix of the Telstar II satellite occurred on passes 62 
and 63, when a total of three flash series were observed. Connecting 
this determination to that of pass 496 requires a residual magnetic 
moment of —0.48375 microweber-meter. The fit through pass 643 is 
given in Table III, and some typical attitude loci are plotted along with 
the attitude prediction in Fig. 25. Corresponding ecliptic angle, solar 
aspect, X and Y torques appear in Figs. 26 through 29. 
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Fig. 25 — Telstar II precessional history and prediction — passes 62 to 3000. (See foot- 
note, p. 1718.) 
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Fig. 26 — Telstar IT ecliptic angle — passes 62 to 3000. 


The solar offset is due to exceed the 15° limit dictated by temperature 
balance considerations (see Section I) by pass 1250 (see Fig. 27). Since 
the maximum offset which occurs on pass 1400 is only 18°, no plans are 
contemplated to reorient the Telstar II satellite strictly to prevent this 
mild excursion. 

There is, however, another excursion beyond the 15° limit, around 
pass 2670. Besides this excursion being more serious than the former, 
some interesting differences between these two events exist, as Fig. 
25 illustrates. 

On pass 1400, the attitude will be returning to a region nearer the 
south ecliptic pole (located at right ascension 90°, declination 67.5°) 


ATTITUDE DETERMINATION AND PREDICTION 1711 


PUTTETTEETTTESTETTTE TPN TEESE PET 


ST AS TEL 


| 
MA 


aa 

aera 
= 

—-—— 

== 





200. 400. 9. 800. 1000. 1200. 1490. 1800. 1800. 2000. 2200. 2400. 2600. 2800. 
PERIGEE NUMBER 


TELSTAR If ATTITUCE + BELL TELEPHONE LABORATORIES - LC THCMAS 
MAGNETIC MOMENT=-.48375 M-WeM, OT13,INITIAL ATT FM FLASHES PASS 62,63 


Fig. 27 — Telstar II solar aspect — passes 62 to 3000. 


thereby limiting the solar offset to values below 15°. On pass 2670, 
however, the attitude is moving away from the south ecliptic pole, and, 
in fact, because of its now lower spin rate is entering a spiral of greater 
excursion than that previously traversed. It therefore behooves us to 
correct the attitude before that latter spiral occurs. 

The question remaining is to determine the most profitable time for 
such a correction. Ideally, a well-chosen time would meet the following 
conditions: 

(2) It would take place just after a good optical attitude fix was 
established to verify the predictions. 
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Fig. 28 — Telstar II X torques — passes 62 to 3000. 


(ii) The attitude maneuver would result in as many mirror flashes 
as possible during correction to indicate its progress. 

(iii) The attitude maneuver would end with the satellite in a posi- 
tion to guarantee a suitable attitude for as long a time in the future as 
is possible. 

At the time of this writing, suitable attitude corrections are under 
study. Most probably the orientation coil will be energized sometime 
between passes 1300 to 1500 in a negative sense or in a positive sense 
near pass 2400 so as to drive the attitude downward and to the left in 
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Fig. 29 — Telstar II Y torques — passes 62 to 3000. 


the sense shown in Fig. 25. This permits the next attitude spiral to occur 
near the south ecliptic pole.* 


VI. CONCLUSIONS 


Techniques for both attitude determination and prediction for spin- 
stabilized satellites have been developed. Their use has been demon- 
* The attitude of the Telstar IT satellite was successfully reoriented by energiz- 


ing the coil on pass 2402 (May 17, 1964) and leaving it in that state until pass 2421. 
Thus attitude predictions given in Fig. 19 beyond pass 2402 are no longer valid. 
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strated using Telstar I and II satellite data. It has been shown that an 
inclined dipole model of the earth’s magnetic field and the method of 
averaging the gravitational and magnetic torques over each anomalistic 
period of the satellite permit attitude predictions to within a few tenths 
of a degree of determined values in most instances. In those few cases 
where departures are above one degree, explanations have been pre- 
sented to show the reason for such discrepancies. The reasons are (1) 
unknown time errors in determining the midpoint of the optical flash 
series and (2) grazing intersections of the attitude determining loci. 
There remains but one anomaly in the precessional motion of the Telstar 
I satellite which, for the moment, is unexplained. Possible reasons for 
this anomaly are given in Section 5.1. 

It has further been shown that the seemingly crude approximation of 
letting the mean anomaly of the satellite equal the true anomaly for the 
purposes of determining the mean torques produces attitudes in close 
agreement (0.01° over 1000 orbits) with more sophisticated approxima- 
tions for orbit eccentricities up to 0.9 and perigee radii above about 
4500 miles. (See Section 4.6, case IT.) 

The comparisons made herein of the precession given by the magnetic 
moment program and the attitude determinations substantiate the 
simplifying assumptions made in Sections 4.1, 4.2, and 4.8. These 
assumptions are most instrumental in producing a working technique 
for both attitude prediction and residual magnetic moment determina- 
tion which is amenable to analytic solution and conservative of com- 
puter time. 

Furthermore, the usefulness of combining optical flash and solar 
sensor data for attitude determination and their inherent accuracy is 
shown. Optical flash data can provide loci with a resolution of 0.1°. 
Solar sensor loci are resolved to within 1°. While it is clearly straight- 
forward to determine analytically the boundaries of the attitude boxes 
from intersecting loci exhibiting estimated time tolerances, this paper 
indicates the decided advantage of graphing the individual loci to 
determine the angle of intersection and thereby gain an estimate of the 
validity of the boundaries in the presence of precession (see, for ex- 
ample, Section 4.1). 

Finally, the techniques described have all been consolidated into 
working computer programs which follow closely the analysis presented. 
In addition, a number of important supporting calculations such as the 
solar position, sidereal time, orbit updating, etc. are developed. Because 
of the complexities of the mean torque and gyroscopic equations, the 
precessional analysis is most useful when embodied in suitable computer 
programs. 
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APPENDIX A 


Derivation of Equation (40)"" 


Beginning with (39) of the main body of the paper, it is certainly 
evident that 


K(isty + Jojo) X K = (K-i,) (ig X K) + (K+5y) Go X Kk). (39a) 
It will be shown that the following identity exists 
(K-i,) (ig X k) + (K-jy) (jg X kK) + (k-k,) (kK, X k) = 0. (39b) 
Rewriting (39b), one obtains 


[(k-i,)i, + (K-j,)jo + (k-k,)k,] X k = 0 (39¢) 
and 
(k-ig) = (agig + Yoio + 20K a) -ig (39d) 
= 2g (39e) 
where 


Lg, Yo, 2% = components of k along the ORDEF axes. 
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In like fashion 
(K-jo) = Yo 
(k-k,) = 2. 
Substituting (39e,f,g) into (39c), we obtain 


(Xgig + Yolo + 2k) Xk =O 
kx k=0 
0= 0. 


Therefore 
—(k-k,)(k, X k) = k- (ijiy + joj) X k, 
and (40) follows from (39). 


APPENDIX B 


Evaluation of the II Integral of Equation (73)" 


(39f) 
(39g) 


(39h) 
(391) 


(89)) 


(39k) 


Copying the II integral from (73) of the main body of the paper, we 


have 
Qr 
II = (cos B)k x/ {K — 3(i,-K)i,} (1 + e cos v) dv. 
0 
Split (74) into two parts by letting 


Qa 
IIA = cos 6k xk | (1 + cos v) dv 
0 


Qn 
cos BK X I (1 + ecoswcosP + esinw sin P) dw 
0 


(27 cos B)k X K 


and 
Qqr 
IIB = (—3 cos 8)k X K: | ii,(1 + e cos w cos P 
0 


+ esinwsin P) dw. 
But 


i, = cos wi, + sin wj, 


(74) 


(75) 
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therefore 
id, = cos’ wisi, + sin” wj,jg + sin w cos w(igjg + jig). (78) 
It follows that 
JIB = (—82 cos6)k X K- (igig + jgjz)- (79) 
So that, combining ITA and IIB 
II = (wcos8)k X [2K — 8K: (igig + j,5a)] (80) 
= (rcos8)k X [2K — 3(K-j,) jo] (81) 
= rcos6[2k X K — 3(K-j,) (KXj,)]. (82) 


We shall now proceed to express (82) in the SANOR system. From 
Fig. 10, 


K = sin 6j + cos 6k. (83) 
From (22) and (20) we write 
ty = (CD),(DC)x (84) 


or 
jo = (sin w cos 2 cos 7 — sin Q cos y cos 2)i 
+ (sin 7 sin @ + sin © cos 7 sin w cos 0 
-+- cos 2 cos 7 cos W cos 6)j (85) 
+ (sin 7 cos 6 — sin Q cos7 sin y sin 6 
— cos 2 cos 7 cos y sin 6)k 
or, simplifying 
jo = [— coszsin (Q — y)]i 
+ [sin z sin 6 + cos 7 cos 6 cos (Q — W)]j (86) 
+ [sin 7 cos 6 — cosz sin 6 cos (Q — w)]k. 


Substitute (83) and (86) into (82) to obtain 
II = wr cos al (-2 sin 6)i — 3[{sin 7 sin’ 6 
+ cos i sin 6 cos 6 cos (2 — Ww) + sin z cos’ 6 


— cos 7 sin 6 cos @ cos (Q — w)}{(—sin 7 sin 6 (87) 


— cos 7 cos 6 cos (Q — w))i 


+ (—cosisin (Q — v))i1]. 
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Finally, 

Il = 7 cos B[{ —2 sin 6 + 3 sin’ 7 sin 6 
+ 3 sin 7 cos 7 cos 6 cos (Q — w)}i (88) 
+ 3 sin7cos7zsin (Q — w)jj. 


APPENDIX C 


Case I Expansion 


Refer to integrals of (91), (92), (93), and (94) of the main body of 
the paper: 


Qn 2a Qa 
A =| sin yo(1 + e cos v) dv = [ sin yo dv + e | sin yo cos v dv 
0 “0 0 
i re (109) 
= sin no | cos (bv) dv + cos no / sin (bv) dv 
0 0 
Pig 
+e sin m | cos (bv) cos v dv 
0 
fe (110) 
+ cos n [ sin (bv) cos v av] 
0 
= sin 40(D: + eDs3) + cos m(D2 + eDs) (111) 
1 eb : P 
= (} = a [sin mo sin 2rb + cos mo(1 — cos 2b) | (112) 
where the D factors are listed in Appendix E. In similar manner, 
B = cos no( Dy + eDs) — sin no( De + eD,) (113) 


= E - ros [cos no sin 2rb — sin no(1 — cos 2rb)]. (114) 


The C integral is expanded as follows 
20 
C= i ii, sin n(1 + e cos v) dv (115) 
0 


but 


i, = cos wi, + sin oj, . (116) 
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Therefore 
iggy = cos” wii, + sin” wj,j, + sin w cos w(igjg + joi). (117) 
Let 
C = igi + joigCe + Cigio + ini) Cs (118) 


and evaluate C; , Co , C3 separately. 
2a 
Cy = I cos w sin 7(1 + e cosv) dv. (119) 
0 


Letting w = v + P, expanding cos’ and collecting terms, we have 
Ci = sin nolcos’ P(Ds + eD;) + sin? P(Dy + eDu) 
— 2sin P cos P(Dy3 + eDsy)| 
+ cos mo[cos’ P(Ds + eDs) + sin? P(Dio + eDiz) 
— 2sin P cos P(Diu + eDyjg)]. 


(120) 


in like fashion 
Cz. = sin qolsin® P(Ds + eD7) + cos’ P(Dy + eDu) 
+ 2sin P cos P(Di3 + eDs)] 
+ cos no[sin? P(Ds + eDs) + cos’ P(Dip + cD) 
+ 2 sin P cos P(Dy + eDi5)] 


(121) 


and 


Qa 
C3; = | sin w cos w sin n(1 + e cos v) dv 
Cc 


sin nol (cos’ P — sin” P) (Dis + eDs) 
+ sin P cos P (Ds — Dy + eDz — eDu)} 
+ cos no[(cos’ P — sin? P) (Di + eDig) 
+ sin P cos P (Dg — Diy + eDs — eDiz)|. 


(122) 


Since 
2r 
E = / iis cos n(1 + e€ cos v) dv (94) 
0 


we expand as with C, to yield 
E = igighhy + jojo + (dodo + joig) Ls (123) 
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so that 


Qr 
E, = / cos’ w cos n(1 + e cos v) dv 
0 


cos nol[cos’ P(Ds + eDz) + sin? P(Dy + eDu) 
— 2 sin P cos P(Dis + eDis)] 
— gin nolcos’ P(Ds + eDs) + sin’ P(Diyo + eD) 
— 2sin P cos P(Du + eDie)] 


Qr 
ie | sin’ w cos n(1 + e cos v) dv 
0 


cos nolsin’ P(Ds + eD7) + cos” P(Dg + eD) 
+ 2 sin P cos P(Dig + eDjs)] 


I 


— gin mo[sin” P(Dg + eDs) + cos’ P(Diy + eDy) 
+ 2 sin P cos P(Dus + eD 5) | 


and 
27 

Ey = / sin w cos w cos y(1 + e cos v) dv 
0 

cos mol(cos’ P — sin’ P)(Dis + eDss) 


+ sin P cos P(Ds — Dy + eD; — eD1)] 
— sin no[ (cos P — sin’ P)(Du + eD1) 


+ sin P cos P(Dsg — Dio + eDg — eDiy)]. 
APPENDIX D 


Case II Expansion 


(124) 


(125) 


(127) 


(128) 


(129) 


We will now evaluate (91), (92), (93) and (94) once again in a manner 


similar to Appendix C, but this time using the approximation 


M = 0 — (X/b) sin v. 


(102) 
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The same notation will be used for the integrals 


29 
4 = / sin 7(1 + e cos v) dv 
0 


2x 
= sin no / {cos bv cos (A sin v) + sin bv sin (A sin v)} 
0 


(143) 
-(1 + e cos v) dv 
2a 
+ GOs xo | {sin bv cos (A sin v) — cos bv sin (A sin v)} 
0 
-(1 + ecosv) dv. 
Using the approximations of (107) and (108), we have 
A = sin qolDi + eDs + (Dia + eDus) — (d7/2)(Do + eDu) 
— (°/6) (Dis + eDis)] 
‘ (144) 
+ COs no[D» -}- eD, — (Dee “f+ eD33) = (X /2) 
(Dy + eDy) + (d°/6) (Dar + eDoe)]. 
In like manner, 
29 
B= [ cos n(1 + e cos v) du (145) 
0 
= cos mo[Di + eDs + (Du + eDu) — (X°/2)(Do + eDu) 
= (X"/6) (Dis “1 eDy9) | 
; (146) 
— sim no[|De + eD, = A(Doo + eDy3) = (X /2) (Dy 
+ eDy) + (1°/6) (Da + eDse)]. 
Similarly, 
Qr 
C1 = / cos’ w sin n(1 + e cos v) dv (147) 
0 


Ci = sin no| cos" P{ (Ds + eD;) + (Dig + eDog) — (X°/2) (Des 
+ eDss) — (d°/6)(Dar + eDss)} + sin’ P{(Ds + eDu) 


+ A(Dig + eDig) — (X?/2) (Dao + eDsg) — (r°/6) 
-(Ds5 + eDu)} — 2 sin P cos P{ (Dy -- eD5) + A(Dis 
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a eDes) = (X"/2) (Dos + eD32) = (0/6) (Dsz “fe cD») | 
(148) 
+ cos nol cos’ P{ (De + eDs) = (Dis + eDrs) = (\?/2) 


(Dey + eDeor) + (°/6)(Dxe + eDu)} + sin” P{ (Dro 
+ eDw) — MDa + eDx) — (d/2)(Deoy + eDsz) 
+ (A°/6)(Dss + eDaz)} — 2 sin P cos P{(Du + eDrs) 
= Du + eDss) — (d?/2)(Diy + cD) + (d°/6) (Das 
+ eDw)} |. 


To simplify the writing of the mean magnetic torque equation, let 
Fy = (Ds + eDz) + (Dig + eDos) — (d°/2)(Dos + eDo5) 


3 (122) 
— (N'/6) (Dai + eDs;) 

Fo = (Do + eDu) + A(Dig + eDyy) — (r’/2) (Do + eDss) (123) 
— (d°/6)(Dss + eDar) 

Fs = (Dis + eDis) + (Diz + eDos) — (X°/2)( Doo + eDse) (124) 
= (X°/6) (Dis + eD39) 

Fy = (De + eDs) — ACDis + eDog) — (N°/2) (Dos + eDe7) (195) 
+ (0°/6) (Daz + eDss) 

Pry = (Dro + eD,») = MDa aia eDe.) = (X’/2) (Dog sig eD37) (126) 
+ (4°/6) (Dis + eDs) 

Pe = (Dis a i eDig) a7 MDu i eDe3) = (X’/2) (Dis + eDs1) 
(127) 


+ (r°/6) (Dig + eDao). 


By the similarity of C, and C3 integrals to Ci and of #1, M2, Hs to Ci, 
C,, C3 we immediately write and summarize the following: 


C, = sin mo[F1 cos’ P + F. sin’ P — 2F; sin P cos P| 


+ cos nolFs cos’ P + F; sin? P — 2F, sin P cos P| ee 
Cy = sin nlf isin” P + F. cos’ P + 2F3 sin P cos P| 

+ cos [4 sin” P + F; cos’ P + 2F¢ sin P cos P| oe) 
Cs; = sin m[(F1 — F2) sin P cos P + F3(cos’ P — sin’ P)] (130) 


+ cos no[(F4 — Fs) sin P cos P + Fe(cos’ P — sin’ P)| 
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E, = cos mlF1 cos’ P + F, sin? P — 2F; sin P cos P| 


— sin noll’s cos’ P + F; sin” P — 295 sin P cos P) ey 
LH, = cos nolF 1 sin? P + F, cos’ P + 2F3 sin P cos P| 

— sin nol; sin’ P + F; cos’ P + 2F5 sin P cos P| er 
Es = cos nol(F1 — Fe) sin P cos P + F3(cos’ P — sin” P)] (133) 


— sin no[(l'4 — Fs) sin P cos P + F(cos’ P: — sin’ P)). 
APPENDIX E 
The D Integrals” 


27 
Di = cos (bv) dv = * sin (2b) 
0 


la 1 
D,. = I sin (bv) dv = = [1 — cos (27b)] 


b 
Q29r . 
D; = cos (bv) cos v dv = _) sin (2xb) 
0 1-8 
2a b 
Ds = i sin (bv) cos v dv = —-———-- [1 — cos (27b)] 
0 1 — BP 
or . 9 2. b? 
Ds = I cos (bv) cos v dv = baa sin (27rb) 
Qr ) b? 
— 1 2 — ~~ a= 
Ds = I sin (bv) cos’ v dv = ba = By [1 — cos (27b)] 
Qa 2 
Z 3 Brats b(7 — b) : 
Dy, = i COs (bv) cos vdv = (i — 8)(9 — BY) sin (2b) 
b(7 — B°) 


Qr 
Ds; = sin (bv) cos’ v dv = — [1 — cos (2b) ] 
0 


25 . 
fs ie _ 2sin (27rd) 
Dag = I cos (bv) sin’ v dv = (4 — &) 
2a 5 9 
ee as / a 
10 ; sin (bv) sin’ v dv ia — B) [1 — cos (2rb)] 
Dy = : sin’ v cos v cos (bv) dv = — 2b sin (2xb) 


0 (1 — b?)(9 — 6?) 
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2a 
= ] 2) . _ —22b _ 
Dy = | sin’ v cos v sin (bv) dv = (1 — &)(9 — 8) [1 — cos (27b)] 
2a 
Dy = / sin v cos v cos (bv) dv = 1 — cos (2rb) 
0 =o 
Qa 
Dy = | sin v cos v sin (bv) dv = _ sin (2b) 
0 a7 
(3 — b*) 


27 
Dy = sin v cos’ v cos (bv) dv = [1 — cos (27b)] 
0 


(9 — b?)(1 — 6?) 
(3 — b’) sin (27b) 


Qa 
. 2 . 
Dig = i sin v cos v sin (bv) dv = ~ 9— B)(1 — B) 





Qa i 
sin (2rb) 
Dy = sin v sin (bv) dv = pie ace 
2a i 
_ ae _ 6 sin (27) 
Dig = i sin’ v sin (bv) dv = @— bd — &) 
27 ] 
se 6 sin (27b) 
Dy = i sin” v cos v sin (bv) dv = 6 — 8) (4 — B) 
. gs ie ese 


Dy = i sin v cos (bv) dv a 


6[1 — cos (2rb)] 


Qr 
4 = 7 3 , = 
Dy = [ sin’ v cos (bv) du O- Bd) 


-- 6[1 — cos (2rb)] 





Dy» = [ sin’ v cos v cos (bv) dv = 6 bya — 

Da i. sin’ v cos’ v cos (bv) dv = ae 

Dy = ie sin’ v cos’ v sin (bv) dv = eee 

Dos = i‘ sin’ v cos’ v cos (bv) dv = = ee Oe 
_ (10 — 0°) sin (2nd) 


sin v cos’ v sin (bv) dv = 


S 
II 
om 


“(16 — 8) (4 — 8) 
a sin’ v cos’ v sin (bv) dv = a SE ve 
(10 — b’)[1 — cos (2rb)] 

(16 — B)(4 — & 





S 
II 
on 





Qn 
* 3 
Dog = [ sin v cos v cos (bv) dv = 
JQ 
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24{1 — cos (27b)] 
b(16 — b?)(4 — Bb?) 
24 sin (27d) 

b(16 — b7)(4 — B?) 
__ 6(5 — b*) sin (27b) 

(25 — b?)(9 — 6?)(1 — 6?) 
6(5 — b’)[1 — cos (2xb)] 
(25 — b?)(9 — b7)(1 — B?) 
= 6 sin (27b) 

(36 — b?)(4 — 6B?) 


ge See ~  6[1 — cos (2rb)] 
ip eee ie _ : at es 
34 » Sin!» cos’ » cos (bv) dv Gé- GB) 


Qqr 
Do = sin‘ v sin (bv) dv = 
0 


Qa 
Ds = sin‘ v cos (bv) dv = 
0 


27 
. 3 2 5 
Du = [ sin’ v cos uv sin (bv) du 
0 


Ir 


27 
- 3 2 ; 
Dy. = [ sin’ v cos v cos (bv) dv 
0 


27 : 
+228 3 ; 
D33 = [ sin’ v cos’ v sin (bv) dv = 


2r i 
7 ee ok foe 120 sin (27b) 
D3 = [ sin’ v sin (bv) dv = 25 — B)9 — BA — &) 
2 
ee alee _ 120[1 — cos (27b)] 
Dsgp = [ sin’ v cos (bv) dv = —-AHO-wA-B 
2 
7 24b[1 — cos (27b)] 
etl as Sig oe ee) 
37 , (Silt » cos » sin (bv) dv (25 —b)(9 —-BUd —B) 


24b sin (2rb) 


Qa ‘ 
Das =f” sin » c08 vc0s (Uo) do = ————2#b sin (Zed) 
a , ‘Silt ¥ cos v co (bv) dv (3 —-H)0-HA-B) 








2a : 
24(6 — b°)[1 — cos (27b)] 
D = [ 4 F b — 
39 : SsIn v cos v sin ( v) dv b(36 = b?)(16 = b2) (4 a b?) 
2 2) sj 
of ey ; 7 24(6 — b°) sin (2zb) 
Qr ; . 120 sin (2b) 
ar 5 SS os Or 
Da = [ sin’ v cos v sin (bv) dt (36 — 8) (16 — bd) 4 —B) 


120[1 — cos (2rb)] 
(36 — b)(16 — b2)(4 — Bb)” 





27 
* 5 
Dy = : sin’ v cos v cos (bv) dv = 
0 . 
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Directional Control in 


Light-Wave Guidance 


By S. E. MILLER 
(Manuscript received May 11, 1964) 


The transmission of light waves for communication in a medium sheltered 
from atmospheric effects requires wave guidance providing frequent changes 
in direction of propagation. This paper shows that, in any electromagnetic 
waveguide having transverse planes in which the field ts essentially eguiphase, 
the transverse width of the field distribution 2a and wavelength d determine 
the order of magnitude of the direction-determining parameters, Rimin, the 
minimum bending radius, and b naz, the maximum abrupt angular changes, 
according to the relations 


Ruin = 2(0°/®) 
Smaz = 3(A/a) 


which are valid in the region X < a. The significance of Rinin ts apparent, 
with the note that in a system containing a multiplicity of bends, an approprt- 
ate way of summing the effects of the individual bends should be used to es- 
tablish an over-all equivalent bend radius for the complete transmission 
path, which must be larger than Rmin . The quantity dmaz may be regarded as 
the maximum value of the accumulated angular errors (rms sum, for ex- 
ample) in a transmission line including reflecting or refracting elements for 
directional control. For a light beam at \ = 0.6328 microns having a diame- 
ter of 1.0 mm, bmaz = 0.036° and Rinin = 600 meters. 

Small-diameter beams ease the problem of directional control. There 1s no 
fundamental reason why small beams should not be achievable with low loss 
in the straight condition, but many guiding structures do have an inverse 
relation between beam diameter and straight-condition attenuation coefficient. 
To explore the direction-controlling properties of specific media and the in- 
teraction of Rinin ANA bmaz With straight attenuation coefficient, the following 
waveguides and associated criteria for establishing Rinin aNd Smaz Were 
studied: 

(1) sequence of lenses: criterion, beam deflection from nominal axis by 
one beam radius, 


1727 


1728 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


(2) hollow dielectric waveguide: criterion, added bend loss equal to 
straight condition loss, 

(3) round metallic circular-electric waveguides: for helix guide, criterion 
ts bend loss equal to straight loss; for simple metallic tube, criterion is a 
transmission ripple (due to mode conversion) of about 1.7 db. 

In all cases the functional dependence on a and d for Rinin OF Smaz Was the 
same (given above) as derived for the generalized electromagnetic waveguide, 
and the associated constants were in most cases of sumilar magnitude. 


I. INTRODUCTION 


In research on techniques for transmitting light waves over appreciable 
distances for communication it has become evident that control of direc- 
tion of propagation is an important and difficult problem. Electromag- 
netic waves in free space travel in a straight line. In a medium that is 
sheltered from atmospheric effects, frequent changes in direction are 
necessary to follow vertical terrain contours and to conform to a hori- 
zontal path avoiding physical obstacles and regions of high-cost installa- 
tion. The wave guiding medium must provide these direction changes. 

In this paper some simple relations are derived to give the order of 
magnitude of the direction-determining factors, bending radius and 
abrupt tilt angle, for any wave guiding structure as a function of wave- 
length and the transverse dimension of the guided electromagnetic 
wave beam. These simple relations are then compared to the correspond- 
ing more precisely defined quantities for specific waveguides: (1) a se- 
quence of lenses,! (2) the hollow-dielectric waveguide,® and (3) round 
waveguides for circular electric waves. 


II. DERIVATION OF GENERAL WAVEGUIDE DIRECTIONAL SENSITIVITY 


In Fig. 1 we show a generalized waveguide for electromagnetic waves, 
with an abrupt open end radiating into free space. We assume the field 
at the aperture is essentially equiphase, which implies ending the guide 


WAVEGUIDE ea 





Fig. 1 — Waveguide with abrupt open end. 
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only at certain longitudinal locations if a periodic form of guidance (such 
as a sequence of lenses) is employed. Let the field strength variation 
across the aperture be approximately sinusoidal. Then, approximately, 
the far-field beam angle @ is 


6 = d/a radians (1) 


in which we require a > i. Other aperture distributions would give the 
same order of magnitude for 6. In the near-field region the radiated beam 
remains collimated in a width approximately 2a out to a distance 2, from 
the aperture, where 


26 ==. 20 (2) 
Zo = 2a’/n. (2a) 


The key inference on directional sensitivity is introduced here. Since in 
the absence of the guide the beam remains confined to essentially the 
same region as in the presence of the guide, it is concluded that the guide 
has little influence on the beam over the interval z, . Thus any appreciable 
change in direction of wave propagation must not be made in a distance 
less than 2, . 

With reference to Fig. 2, the departure of a circular arc from the 
tangent is 


A = 4(P/R). (3) 


We now require that A = a when / = z,. Using (3), (2) and (1), we 
obtain the minimum bend radius Rmin 


Rin = 20°/d. (4) 





Fig. 2— Departure of a circular are from the tangent. 
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Alternatively, this same relation may be arrived at by specifying that 
the change in direction shall be one beamwidth 6 after traveling a dis- 
tance z, in the minimum bending radius Rin ; 1.e., Rmind = 2 - 
Equation (4) gives the order of magnitude of bend radius at which 
the wave propagation will change character. At longer bend radii the 
wave propagation will be essentially as in the straight guide, and at 
shorter bend radii something drastic will happen. Just what changes oc- 
cur in the latter case depend on the nature of the medium in detail. If 
the medium is enclosed in a perfect conductor the change will be large 
mode conversion. If it consists of a sequence of infinitely wide lenses we 
will see that the change is a wide oscillation of the beam about the nomi- 
nal axis of propagation. Note that in neither of these cases is energy lost 
due to the bend. Nonetheless, we regard either change as undesirable. 
Consider an abrupt angular change in the guide direction, 5. Following 
a line of reasoning analogous to that given above, we can say that the 
character of wave propagation will change rapidly in the region where 


Ona = 0/2 = dj 2a: (5) 


Smaller values of 6 will cause progressively less change in wave character, 
whereas larger values of 6 will cause violent changes. 

If we consider the relation of these quantities to a wave guiding me- 
dium, it is apparent that R,,in is intended as the smallest radius at which 
the otherwise uniform medium can be bent. When a multiplicity of bends 
is included in a single transmission link, some way of summing their ef- 
fects is needed to form an equivalent bending radius which must be 
greater than Rinin . 

The angle 6 is somewhat different. In many media where a > d it is 
possible to insert a large plane reflector and introduce a change of direc- 
tion of arbitrary size. As long as the guides at both approaches to the 
reflector are perfectly aligned according to geometric optics, the dis- 
turbance on wave propagation may be negligible. However, there will be 
an error in such angular alignment and 6max tells us how large that error 
may be. When many random angular errors are made, dmax 1S aPproxi- 
mately the rms accumulation of such errors. 

The numerical values of Rmin and dmax have been plotted for \ from 
0.6328 to 10 microns and beam radius a from 0.1 to 100 millimeters in 
Figs. 38 and 4, respectively. For example, at ) = 0.6328 microns and 
2a = 1.0 cm, Rmin = 600,000 meters and Smax = 3.6 X 10° degrees. 
Dropping to 2a = 1.0mm, Rmin = 600 meters and bmax = 3.6 X 107 
degrees. 

We consider next certain specific wave guiding structures to compare 
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Fig. 3 — Beam radius vs minimum waveguide bend radius. 
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Fig. 4 — Beam radius vs maximum angular error. 
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the results of directional changes in those structures to the generalized 
conclusions drawn above. 


III. SEQUENCE-OF-LENS WAVEGUIDE 


G. Goubau has proposed! a waveguide for electromagnetic waves con- 
sisting of a series of lenses, and D. Marcuse has used geometric optics to 
determine the effects of bends in such a waveguide.” 

If the input to a lens waveguide is a ray which is inclined at an angle 
6 to the longitudinal waveguide axis (Tig. 5) the departure of the ray 
from the longitudinal axis has a magnitude at successive lenses which is 
contained within an envelope which is a sinusoidal function of distance 
along the longitudinal axis. Starting with the work of Marcuse, one can 
show that there is an optimum strength of lens which minimizes the de- 
parture of a ray from the axis; the optimum focal length f is related to 
the lens spacing L by 


of = ih (6) 


and under that condition the maximum deviation of the ray from the 
longitudinal axis is 


Tmax—1 = 6b. (7 ) 


Consider a region of bend radius F following a straight region of lens 
waveguide. For a ray incident on the curved region from the axis of the 
straight region Marcuse has also calculated the ray’s departure from the 
axis in the curved region; for the case f = L/2 the maximum departure is 


Tmax-2 = L’/R. (8) 


We now relate these departures from the guide axis to the transverse 
dimension of the beam. It is convenient to consider the beam radius to 
be that value of radius beyond which a completely negligible amount of 








Fig. 5 — Sequence of lenses. 
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field exists. We define this beam radius as 
ry = (Nol dX)? (9) 


Here Np is the Fresnel number which previous work*” shows is on the 
order of unity for negligibly small diffraction loss when all energy out- 
side the radius 7 is absorbed at the lenses.” 

As a criterion of the maximum permissible abrupt angular change, we 
somewhat arbitrarily set it to be that angle at which 7yax_1 is equal to 
the transverse beam radius 1 : 


Smax = (%/L) = (Nod/L)*? = Nod/ro- (10) 


Since No = 1, this specific guide and criterion gives a permissible angular 
change of twice that prescribed by (5). This may be considered an ex- 
cellent agreement. 

As a criterion of the minimum permissible bend radius, we set the re- 
sulting beam deflection 7max_2 equal to the transverse beam radius 7; : 


Ros. = L?/r, = re’ /(Nod)’. (11) 


Since No & 1, this specific guide and criterion gives a permissible bend 
radius of one-half that prescribed by (4), which again may be considered 
excellent agreement. 

The most important aspect of the comparison between (4) and (5), 
(10), and (11) is that the corresponding equations have the identical 
dependence on \ and a, which determines in a broad way the magnitude 
of the direction determining parameters. 

In this form of guide we can readily relate the beam radius to the as- 
sociated lens spacing and the losses. Fig. 6 shows the lens spacing L 
versus beam radius in the 0.5- to 4-micron wavelength region. As before, 
No will be about unity, but where extremely low losses per lens are re- 
quired may have to be slightly greater than unity.’ For the 1.0-mm beam 
diameter referred to above, the lens spacing is about 0.4 meter for 
\ = 0.63 microns. 

In principle, vanishingly small transmission loss could be obtained by 
appropriate choice of lens diameter (1.e., choice of No), if the reflection, 
absorption, and scattering losses were negligible at the lenses. In practice, 
such losses may be very real. Fig. 7 shows the total losses per lens re- 
quired as a function of lens spacing with net transmission loss as a 
parameter. For 3 db/mile net loss and the 0.4-meter lens spacing, a power 
loss per lens of about one part in 10° is required. 


* Further discussion of No and rz is given in the Appendix. 
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Fig 6. — Approximate beam radius vs lens spacing in a sequence of identical 
lenses. 


In general, of course, the smaller beam diameters which permit rapid 
direction changes require more tight guidance (closer lens spacing) and 
tend to increase the losses. Note, however, that the only inherent losses 
associated with tight guidance for the lens guidance system are due to 
scattering or reflection at lens surfaces or bulk lens absorption loss, both 
of which may conceivably be made very small. 


IV. HOLLOW DIELECTRIC WAVEGUIDE 


E. A. J. Marcatili and R. A. Schmeltzer have proposed a waveguide 
for light waves consisting of a hollow dielectric tube in which the useful 
energy is entirely confined to the central hole. When the guide is 
straight, loss takes place through very slow radiation into the dielectric, 
which is completely absorbing for the light energy. 

The bending radius for such a guide which makes the extra loss due to 
bending (also a radiation loss) exactly equal to the straight-guide at- 
tenuation coefficient has also been determined.’ They find, for the lowest- 
order mode (EHu), in which the field varies roughly cosinusoidally 
from the axis to the inner wall of the tube, 


Ruin-p = 9.5 0°/¥ (12) 
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Fig. 7 — Lens loss vs spacing for prescribed total loss. 


where a is the inner radius of the tube. Once again, the functional de- 
pendence of Rmin—p on a and 2 is identical to that in (4). 

The straight-line attenuation coefficient for the lowest-order mode 
can be reduced to” 


as = 0.214 */a° (13) 


for an index of refraction of the dielectric tube equal to 1.5. As Marcatili 
and Schmeltzer have pointed out, the dependence of a, on a and d is the 
exact inverse of that for Rinin_p ; hence for any prescribed straight-guide 
loss there is a minimum permissible bending radius, which for the lowest- 
order mode is 


Rmin-p = 2.03/a; . (14) 


Yor fixed a, this is independent of X. For Rnin-p = 1000 meters, a, = 
0.002 nepers/meter or 27.9 db/mile, and at \ = 0.6328 microns, the 
inner radius of the tube a = 0.35 mm. 


V. CIRCULAR ELECTRIC WAVEGUIDES 


We consider now the directional control relations for round waveguides 
designed for the TE circular electric wave. Helix waveguide and smooth- 
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walled metallic waveguide involve quite different criteria for tolerable 
bending radius and will be discussed separately. 

Consider first metallic guides in which the losses are negligible com- 
pared to the mode coupling coefficients. We assume the degeneracy be- 
tween To and TMy is broken with a dielectric lining or other guide 
modification. Then the limit on bending radius or abrupt tilt is the inter- 
fering effect between the unperturbed TI; energy and the energy which 
is converted to an undesired mode and reconverted back to the TE 
wave. 

For an abrupt tilt, the amplitude conversion coefficient from TE 
to TEy. was found by 8. P. Morgan‘ to be, approximately 


p = 1.935 (a/v)é (15) 


where 6 is the tilt angle in radians. When converted energy from one tilt 
strikes another tilt (presumed for simplicity here to be the same angle), 
energy is reconverted back to the TE: wave with the same conversion 
coefficient given by (15). The amplitude of the reconverted wave com- 
pared to the unperturbed wave is then p*. Depending on the relative 
phase of the reconverted vector, it may add at any phase angle to the 
unperturbed wave. Hence the amplitude transmission coefficient varies 
with wavelength between (1 + p’) and (1 — p’). Letting a transmission 
fluctuation of 1.7 db, corresponding to p” = 0.1, be the criterion of limit- 
ing tilt angle, we find 


Smax-m = Dd/1.935a = 0.164 d/a. (16) 


Comparing, this to the generalized relation for 6max in (5), we note the 
identical dependence on \ and a and a somewhat smaller constant multi- 
plier. In practice, the existence of coupling to other modes would tend 
to make somewhat smaller values of 6max—ar needed, but the dependence 
on A and a would not be affected. 

Still considering guides with negligible losses, we examine the effect 
of a constant-radius bend. It may be shown that the reconverted vector 
has the magnitude 


pr = ke /(T; — 12)” (17) 


where k, is the distributed coupling coefficient between TE: and an un- 
desired mode and T, and I, are the propagation constants for TE: and 
the undesired modes, respectively. For TE: to TEy2 


ki & j(2a/oR) (18) 
where ? is the bend radius and a is the radius of the guide. Also, 


(T, — Te) & 7(6i62) = 7 do/a’. (19) 
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Hence 
pi = 4a°/ Rn‘. (20) 


As a criterion for minimum bend radius we set py equal to 0.1, giving 
the same ripple in transmission loss as noted above, with the resulting 
bending radius from (20) 


Rmin-u = (4/p’)*(a°/n’) = 6.3 a°/d. (21) 


Comparing (21) to (4), we again find the identical dependence on a and 
d with a slightly different constant. 

Turning now to the case of helix waveguide in which very strong loss 
is introduced for the undesired mode, we find we do not have explicit 
forms for the coupling coefficient. We take advantage of some numerical 
evaluations carried out in the 30- to 100-kme region on guide varying in 
diameter from 0.25 inch to 3 inches. It was found that the bend loss 
coefficient is given by the expression* 


= 0.0726 (a°/RX”"). (22) 


The TE: loss of the guide when straight is very nearly that of a copper 
cylinder, given by 8. A. Schelkunoff as° 


= 4.46 X 10°° A?/a°. (23) 


In (23) we have assumed a? > (/2)’, so that the cutoff effect is negligi- 
ble. As our criterion for minimum bending radius we equate the bend 
loss az and the straight-line loss a; , yielding 


Rmin-w = 128 a°/n". (24) 


The functional dependence of Ryin_y on \ and a is very nearly the same 
as in (4), but the constant multiplier is much greater. This is a conse- 
quence of the criterion a, = ag, which is thereby proven much more 
stringent than the rather lax transmission ripple criterion used above for 
the metallic tube guide in which dissipation was negligible. Since a, of 
(23) and Rmin_nx of (24) have different dependence on \, a change of 
wavelength will influence Ruin_x even though a, is held constant. We 
can express this by substituting (23) into (24), giving 


128 a 5.71 X 10% 


nO-6 “SE V-Sa ; (25) 


Ruin-z = 


At longer wavelengths, smaller bending radii are tolerable even though 





* This is the result of unpublished calculations by the author, based on cou- 
pling coefficients derived by methods due to Unger® and using coupled-wave 
theory.?® 
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a, is held constant by increasing a. At a wavelength of 5 mm and a 
straight attenuation coefficient of 1 db/mile, Rmin-y = 191 meters. This 
result and the numerical constant in (22) are dependent to some extent 
on the wall impedance to the undesired modes used in the numerical 
evaluations referred to above, which was on the order of one-half the free- 
space intrinsic impedance. 


VI. CONCLUSION 


For any guided electromagnetic wave, the order of magnitude of the 
direction-determining parameters Ruin (the minimum bending radius) 
and bmax (the maximum abrupt angular change) are uniquely determined 
by the wavelength and transverse beam. dimension. Equations (4) and 
(5) were derived, determining Rmin and 6dmax for a general guided elec- 
tromagnetic wave by inferring the tightness of guidance from the be- 
havior of a wave radiated from the open end of the waveguide. Investiga- 
tion of specific forms of waveguide with precise criteria for setting limits 
on R and 6 (as outlined in the abstract) lead to identical functional 
forms for Rin and dmax , With similar constant multipliers. 


APPENDIX 


Previous workers’ have calculated the diffraction loss at a reflector 
in a maser interferometer, and the same loss per lens would be expected 
in a sequence-of-lens waveguide if the entire plane outside the edge of the 
lens (of radius equal to that of the maser reflector) were absorbing. These 
losses are plotted versus N = a’/LX (where a is the reflector radius) in 
Fig. 3 of Ref. 7 and in Fig. 15 of Ref. 3 for focal length f = L/2. We 
chose No to be that value of N which gives satisfactorily low loss per lens; 
for example, for No = 1, Ref. 7 gives a power loss per lens of one part in 
10* for the lowest-order wave, and for Ny & 1.4 the power loss is one 
part in 10°. Fig. 3 of Ref. 1 shows that 99.8 per cent of the energy of the 
normal mode for infinite lenses lies within the radius r = (L\)? at the 
lens. In practical cases, therefore, No will differ little from unity. 

Another item of interest is the relation between 7, of (9) and the field 
amplitude given by previous workers.’” The field varies as a function of 
radius 7 from the axis of the guide according to 


exp (—r’/w’) (26) 


where w is the radius at which the field drops to e™ of its maximum (on 
axis) value. The value of w varies with longitudinal position between 
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lenses; at midway between lenses w = wo, where 


wo = (LA/2r)*. (27) 
At the lenses, w = w, , and for our cases of f = L/2 
ws = (Ld/r)*. (28) 
It is apparent from (9) and (28) that 
ry = we (Nor)? (29) 
In terms of w, (10) becomes 
and (11) becomes 
Ruin = (4!No *) (ws'/d’). (31) 
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Alternating-Gradient Focusing and 
Related Properties of Conventional 
Convergent Lens Focusing 


By S. E. MILLER 
(Manuscript received June 8, 1964) 


A series of lenses whose focusing properties are alternately convergent and 
divergent (alternating-gradient focusing) is of potential interest in the guid- 
ance of light waves, and has previously been used to focus electron beams and 
high-energy particle streams. New information is provided herein on such 
focusing for the case of equal focal length f (but alternating-gradient) lenses 
equally spaced a distance L. 

The alternating-gradient system formed by adding diverging lenses between 
the lenses of an all-converging sequence of lenses is found to have the same 
stability condition as the original system for 0 < L/f < 2. A physical argu- 
ment leads to the conclusion that weaker divergent lenses would also leave the 
stability criterion unchanged. 

The focusing effect of the allernating-gradient system is surprisingly close 
to that of an all-converging lens system. After the focal length of each has been 
adjusted to an optimum value, the ray departure from the system axis is only 
1.67 times as great for the alternating-gradient system as for an all-convergent 
lens system with the same spacing of convergent lenses. 

For weak lenses (i.e. 2f/L >> 1) the output ray departure due to input ray 
displacement is independent of both the focal length and spacing of the lenses, 
and 1s independent of lens spacing but proportional to focal length for input 
ray slope. 

Both the alternating-gradient system and all-convergent lens focusing ar- 
rangements exhibit discontinutties in the maximum ray displacement ver- 
sus focal strength relation. 

Viewed over-all, alternating-gradient focusing for light guidance does a 
surprisingly efficient job and may be advantageous over all-convergent lens 
systems af the alternating-gradient arrangement has structural or economic 
advantages. 
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I. INTRODUCTION 


In research on guidance of light waves for communication we are con- 
sidering use of a sequence of lenses of alternately convergent and diver- 
gent types. For example, a guidance system using tubular thermal gas 
lenses might employ continuous flow of gas through a tube whose walls 
are alternately warmer and cooler than the gas within. Thus, the mech- 
anism used to cause the focusing may have the alternating character, 
and the question comes to the fore — how well can one focus with such 
a structure as compared to the use of a sequence of all-convergent lenses? 

Alternating-gradient focusing has previously been used on electron 
beams! and on particle accelerators.2 The present study discovered an 
error in the previous determination of stability conditions and revealed 
some little known but interesting properties of alternating-gradient focus- 
ing. A comparison is made with conventional focusing using all-conver- 
gent lenses. 


II. ANALYSIS OF ALTERNATING-GRADIENT SYSTEMS 


One might wonder whether a series of equal-focal-length and alter- 
nately converging and diverging lenses would give any net focusing at 
all, since the average dielectric constant along all paths parallel to the 
axis would be the same. It is well known, however, that a divergent 
lens followed by a convergent lens of equal focal length, spaced a finite 
distance less than the focal length, gives a net converging lens, and the 
same is true if the order of the lenses is reversed. Thus, a net focusing 
is to be expected for an infinite series of such lens pairs.! 

We consider a sequence of alternating convergent and divergent lenses 
equally spaced a distance Z and of equal focal lengths, f. We follow the 
method of analysis used by Pierce.’ There are two cases to cover, one in 
which the first lens of the array is a divergent lens (obtained by starting 
at n = 0 in Fig. 1) and the other in which the first lens of the array is a 
convergent lens (obtained by starting at N = 0 in Fig. 4, below). 

With reference to Fig. 2, and taking the input ray at plane a to have a 
slope ra’ and a displacement r, from the longitudinal axis, the output ray 
from the lens at plane 6b will be 


Tt = Ta + Lrd’ (1) 
rm = 1a + (1/f)ro. (2) 

At plane c this ray will be described by 
To = to + Lr,’ 39) 


t = 7). (4) 
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Fig. 1 — Sequence of lenses — first case. 


Eliminating r, and 7,’ from (3) and (4) gives 





=(144)n+4(24+2)0 (5) 
DL ! Ta 
=(1+ 4), TF. (6) 
Hence, with reference to Fig. 1, we can write 
Tr = ( Sle ") Tn + L (2 + ") Tn. (7) 
Tne L , lr _ Tn4i 
Tn = (1 + 2) Tn ++ 7 f . (8) 


These two equations lead to 


Tn4+2 2 = (2) | Tn41 2 Tr = 0. (9) 


The solution to (9) is 
r, = A cosné + Bsin nO (10) 


| 
| 
| 
| 
es 





Fig. 2 — Lens subsection for Fig. 1. 
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6 = cos” E — (7) | (11) 


and where A and B are constants to be determined. 

Equation (11) differs from the corresponding equation on page 200 of 
Ref. (1), which is believed to be in error. The correct condition for 
stability, from (11), is 


where 


0 < i(L/f)’ <2 (12) 
or 
0<L/f < 2. 
We put in the boundary conditions, at n = 0 
Tn = 10 (13) 
he ts (14) 


We make use of a general theorem* stating the orthogonality of the effects 
of ro and ro’ and seek a solution with those quantities as factors. This 
leads to the following form for 7, , using (10), (7), (13) and (14): 


Tn = Toki cos (nO — gi) + 7o'L ke sin nO (15) 
where 
9 3 
= | ame] ai 
¢g1 = | cos ky? | (17) 
— (02f/L)+ 1] _ 2+ (L/f)] 
"= HE sin as 


The general form of k; and ke versus L/f is shown in Fig. 3. Further dis- 
cussion will be postponed to a later point in this paper. 
We are also interested in the displacement 7, at the output of the 
mth diverging lens (Fig. 1). Using the relation 
tm = Tn + Lr,’ (19) 
and using (7) for r,’, (15) for r, with appropriate trigonometric rela- 
tions, it can be shown that 
lm = Yok3 cos (MO — v3) + ro L ky cos (mé — ga) (20) 
* See Appendix B. 
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Fig. 3 — Coefficients relating input ray slope and displacement to ray displace- 


ment at the nth, mth, Nth, and M/th lenses of the alternating-gradient lens sys- 


tems of Figs. 1 and 4. 


where 


(21) 


am] 


a 


(22) 


3 | 


| cos” k 


p= 


(23) 


ky = 2f/L 


(24) 


and 6 is again defined by (11). Plots of ks and ky are given in Fig. 3. 


—l 


vs = | cos ky" | 
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Equations (15) and (20) give the ray displacements at any lens in the 
system when the input is at a plane adjacent to a converging lens (i.e. 
at n = 0, Fig. 1). Before discussing interesting features of such ray prop- 
agation we will give the corresponding solutions for the case where the 
input is adjacent to a diverging lens (i.e. at N = 0, Fig. 4). 

With reference to Figs. 4 and 5, and following a derivation similar to 
that carried out in connection with equations (1) to (9), it is found that 


Trae — [2 — (L/f)"] tra ty = 0. (25) 


Note that (25) is identical to (9) and with the change of n into N the 
solution for (25) is again (10) and (11). When the initial conditions are 
put in, at N = 0, ry = mandry’ = 1, we get 


ry = rok, cos (NO + os) + ro’Lke sin NO (26) 

where 
pp = 1D — I | 
vs = |cos ks’ | (28) 
be eh) 

Note that i; is identical to kz in (21). 
For rar we find 

rar = rokz cos (MO + 7) + ro’Lkg sin (M0 — gs) (30) 

where 
a on 

1 — (L/2f) 

gi = |cos ky | (32) 
kg = 2f/L (33) 
gs = |cos kg |. (34) 


Note that ks and gs are identical to ky and g, , and that k; is identical to 
ky . Plots of ks , ke , ky , kg are given in Fig. 3. 


III. RELATIONS FOR A SEQUENCE OF CONVERGING LENSES 


For comparison purposes we will want to refer to the case of a se- 
quence of identical convergent lenses equally spaced. The analysis is 
similar to that above for the alternating gradient lenses. The results are 
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Fig. 4 — Sequence of lenses — second case. 


as follows. Let the lens spacing be s, and the focal length be f. Then the 
displacement r, at the pth lens is 


Tp = Tok cos (pd — go) + 10's kyo Sin pd (35) 
where 

_[__4f/s ; 
kg = | tet | (36) 
gy = | cos ky * | (37) 

_ Gis 

be = = CADP” in es 
6 = cos [1 — (s/2f)] (39) 
s/f = 2(1 — cos6). (40) 


The system is stable in the sense that an input displacement 7» or slope 
ro will remain bounded as p is increased if 


0 <s/f <4. (41) 
Fig. 6 shows the values of ky and kyo for comparison to Fig. 3. 

A B Cc 
| | | 
| 
| | 
| | 
| | | 
eee eee 


Fig. 5 — Lens subsection for Fig. 4. 
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Fig. 6 — Coefficients relating ray slope and displacement to ray displacement 
at the pth lens of an all-convergent lens system; f = focal length, s = lens spacing. 


IV. STABILITY COMPARISON 


Since the converging lenses in Fig. 1 or Fig. 4 are spaced a distance 2L, 
a comparison of (41) and (12) shows that the alternating-gradient system 
formed by adding a divergent lens halfway between the convergent lenses 
of an all-convergent lens system has the same stability condition as the 
original system. At a later point it will be shown that this is reasonable 
physically. 


V. THE WEAK LENS CASE 


When the lenses are weak, i.e., when 2f/E > 1, the general expres- 
sions may be simplified to show some remarkable properties of alternat- 
ing-gradient focusing. When the first lens is a diverging one, (15) and 
(20) yield 


Tm = Tn = 1 V2 cos [nd ~— (r/4)] + 10'2f sin nO (42) 
and when the first lens is a converging one, (26) and (30) yield 
ty = Tu = 1 V2 cos (NO + (2/4)] + 70'2f sin NO. (43) 


These expressions show that for an input ray displacement without 
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slope the maximum displacement of the transmitted ray (as n or N 
varies) is +/2 times the input ray displacement, independent of both 
focal length and lens spacing! Also, for an input ray of zero displacement 
but finite slope 7o’, the maximum displacement of the transmitted ray 
is 2fro’, independent of lens spacing L. The angle 6 is dependent on f 
and L and goes to zero as f— ~. 

In an all-convergent lens system the similar condition 4f/s >> 1 leads 
to [from (35)] 


rp = 1 COS pb + ro V/fs sin ps. (44) 


In comparing the alternating-gradient system to the all-convergent 
lens system for weak lenses, we see that for an input ray with zero slope 
ro’ but finite displacement, ro, the maximum output displacement for 
the alternating-gradient system is 1/2 times that of the all-convergent 
lens system. For input ray displacement 7» = 0 but finite 79’, we see that 
the maximum output displacement for the alternating-gradient system 
is larger than for the all-converging lens system by the factor [see (42) 
and (44)]: 


2fro 
(fs)*ro" 


Our assumption of weak lenses made 4f/s > 1, so (45) is a factor of two 
or more. 

In this weak lens case both @ and 6 are small angles, and from (11) 
and (39) 


= (4f/s). (45) 


6=L/f (46) 
6 & (s/f)’. (47) 


Using the case of s = 2, which is the alternating-gradient system formed 
by adding a diverging lens between the lenses of an all-convergent lens 
system 


9/6 = (8/4f)’. (48) 


Since 4f/s > 1 by our weak lens definition, 6/6 is less than unity and the 
period of the alternating-gradient system encompasses a great many 
more convergent lenses than does the all-convergent lens system with 
the same spacing of convergent lenses. This is as would be expected. 


VI. OPTIMUM FOCAL LENGTHS 


We now inquire as to whether there is a best value for the lens strength 
in order to minimize output ray displacement. On the assumption that 
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the sine and cosine terms of (15), (20), (26), (30) and (35) go through 
unity for some number of lenses, the question is whether or not the 
coefficients ki, ke --+ kio have any minima. 

For the all-convergent lens system Fig. 6 illustrates that ky has no 
useful minimum but that ky, relating input ray slope to output ray 
displacement as in (35), does have a minimum. By setting 


d 
df (kyo) = 0 (49) 
we find 
8 
= 5 
T | ont yy (50) 


at which condition kip = 1.0. We note that the displacement r, due to 
ro’ iS To Sky Sin pd, so we have a minimum in this displacement when hy 
is a minimum provided sin pé goes through unity for some number of 
lenses p. This is the most typical case, but there are notable exceptions. 
Suppose, for example, that 6 of (39) is 7/8, corresponding to s/f = 1; 
then pi = 2/8, 27/8, 7, 42/3, etc., as illustrated in Fig. 7, and | sin pé | 
never exceeds sin 6. Hence the maximum value of 79’s kip sin pé is 10's 
for s/f = 1. It is shown in Appendix A that there is an infinite series of 
such discrete values, but the largest departure of the maximum value of 
ro ski sin pé from ky is 15 per cent, occurring at s/f values of 1 and 8, 
as illustrated in Fig. 14 (see Appendix A). 

Turning now to the alternating-gradient system, the only coefficient 
having a useful minimum is ke of (15), relating input ray slope to ray 
displacement at the converging lenses of Fig. 1. We find the minimum in 
ke by setting 


d 
df (ko) = 0 (51) 
which leads to the equation 
(L/f)’ + 4(L/f)* — 8 = 0. (52) 


/ SIN pd 
/ 





Fig. 7 — Diagram of sin pé, p = 1,2,8 --- , when 6 = 7/3. 
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The appropriate root of this equation is 


ae 
f opt 


at which we calculate 6 = 76.4°, and ky = 3.33. Fig. 3 shows this mini- 
mum is sharper than the corresponding one for kyo of the all-converging 
lens system, Fig. 6. We note that k, sin né of (15) contains the sin né/sin 
6 factor, so once again (as described in Appendix A) for 6 = 7/3 and other 
values, the maximum value of k, sin n@ will be somewhat less than the 
value of ke. 

It is important to compare the optimized focusing effect of the alter- 
nating-gradient system to that for the all-convergent lens system. We 
make the comparison on the alternating-gradient system formed by 
adding a divergent lens of equal focal length in between the lenses of an 
all-convergent lens system; then we have s = 2L. The optimized maxi- 
mum displacement due to input ray slope is 


= 1.237 (53) 


10'S = 2r)/L 
for the convergent lens system, and is 
3.00 ro L 


for the alternating-gradient system. It is remarkable that the focusing 
effect of the alternating-gradient system is so nearly the same as that of 
the all-convergent lens system. In practice it may be advantageous to 
get the focusing action in a manner that inherently reverses itself period- 
ically. This analysis shows that such structures are nearly as effective as 
those wherein the focusing effect is always convergent. 

Tig. 3 shows that the focal length which is optimum with respect to 
the input ray slope (kz) is also an acceptable region with respect to input 
ray displacement (k; and ks). 


Vil. RAY PATHS 


One can get a useful physical feel for the wave propagation by tracing 
the rays in a few of the important cases. 

For the all-convergent lens system optimized according to (50), Fig. 
8 shows the ray paths for a zero-slope finite-displacement input ray and 
for a zero-displacement finite-slope input ray.” Here 6 = 90° [see (35)] 
and a period is completed in 4 lenses. As proved in Appendix B, the 

* Note that ky = 2, with s = 2f in (86), but ky cos (ps — ¢s) is always +1 
for any p. (The angle g, = 45°.) This is an example of the caution that must be 


exercised in regarding the k’s as maximum values of the various terms in fr, , Tm , 
Tp , etc. 
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response to an arbitrary input ray can be obtained by a linear super- 
position of the responses shown in Fig. 8. 

For the alternating-gradient system the optimum according to (53) 
corresponds to an angle @ in (15), (20), (26) and (30) of 76.4°, which 
makes the ray path periodic only at a very large number of lenses. How- 
ever, a very useful feel can be obtained from the ray paths for @ = 90°, 
corresponding to L = +/2f and giving a value of k, only slightly larger 
than the minimum value (3.414 compared to 3.33). These ray plots are 
shown in Figs. 9 and 10 for the two types of input rays at the two pos- 
sible points in the alternating-gradient system. We note that input ray 
displacement causes the same maximum displacement in the response 
regardless of where it occurs. Input ray slope is much more serious when 





Fig. 8 — Ray paths in the confocal all-convergent lens system, s = 2f. 


it occurs in front of a diverging lens than when it occurs in front of a 
converging lens. Again, the response to arbitrary input rays can be ob- 
tained by adding the plotted responses. 

One can gain a little feel for the stability comparison made previously 
by looking at Figs. 11 and 12. Even though the 7» term and the 7’ term 
of (85) for the all-convergent lens system go to infinity individually 
when s = 4f, a suitable combination of input ray slope and displacement 
remains bounded and this is illustrated in Fig. 11.* Any reduction in 
focal length f causes instability, and any increase in f leaves the system 
completely stable. It is clear that adding a lens of any kind at the mid- 
point between lenses in Fig. 11 will not alter the propagation of that ray. 
In Fig. 12 we see that adding a divergent lens in between the converging 


* One can obtain these values of rp from (85) by a suitable limiting process. It 
is helpful to start with the alternative form of (85): 


1 ‘ . 
Tp = To {cos po + {f/s) — 1B sin pa + ro’s kig sin pé. 
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' Fig. 9 — Ray paths for system of Fig. 1, L = —/2f. 


lenses will cause reductions in the focal length of the p = 1 converging 
lens to make the ray sent on to the p = 2 lens diverge even more; for 
increases in the focal length of the p = 1 lens, the divergent lens reduces 
the angle of the ray sent on to the p = 2 lens. Hence, it is plausible that 
the addition of the divergent lens between the convergent lenses does 
not alter the stability requirement on the focal lengths. 

Given the mathematically-derived condition that divergent lenses of 





Fig. 10 — Ray paths for system of Fig. 3, L = V/2f. 
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RAY PATH FOR S=4f 


Fig. 11 — Ray path for s = 4f in all-convergent lens system. 


focal length f added to a chain of convergent lenses of focal length f do 
not change the stability criterion as described above, the physical argu- 
ment just outlined leads to the conclusion that weaker divergent lenses 
would also leave the stability criterion unchanged.* 














Fig. 12 — Ray path for f near s/4 to illustrate effect of divergent lens. 


VIII. CONCLUSION 


Alternating-gradient focusing is surprisingly close to an all-convergent 
lens system in focusing ability, and may be preferred if practical matters 
such as structural features or cost favor the alternating-gradient system. 

* When reading this manuscript, Mr. J. P. Gordon commented that this con- 
clusion is in agreement with the work of Boyd and Kogelnik*® which can be shown 


to yield the relation f2 > f: — L/2 for the required focal length f2 of the diverg- 
ing lens in terms of the focal length f; of the converging lens and the spacing L. 
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APPENDIX A 


We examine here the maximum value that the term 7ro’s ki sin pé of 
(35) can take as a function of lens number p when our objective is to 
minimize the term through appropriate choice of focal length. In the 
body of the article it has been shown that 1) has a minimum at s = 2f. 
This corresponds to a value of 6 = 7/2 from (39) and it is evident that 
sin p6 = sin p 7/2 is either zero or unity for all integral values of p. 

In the more general case we want to know the value of 


kyo sin pé = sin pé/sin 6. (54) 


When it is recognized that p may take on all integral values greater than 
zero it follows that the maximum value of [(sin pé)/sin 6] as p varies 
can never be less than unity for any fixed 6. 

It is possible for [(sin pé)/sin 6] to have a maximum value which is 
smaller than ky) = 1/sin 6. That is to say, sin 76 does not necessarily go 
through unity even though p ranges from 0 to © in integral steps. 

Referring to Fig. 18, the maximum value of sin pé will be less than 
unity if 


dq +3) = 1/2 (55) 
or 

6 = r/(2q + 1) (56) 
where q = 1, 2, 8, --- . It also is true that sin pé will have a maximum 
value less than unity for 

6 = r[x/(2q + 1)] (57) 


where r = 1, 2,3,4,---. 

The values of s/f corresponding to these values of 6 and the resultant 
values of maximum ky sin pé6 are given in Table I. Column 5 shows the 
ratio of kio to the maximum of kyo sin pé, and is a measure of the error 


6 
a — 27SIN pd 
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Fig. 13 — Diagram of sin pé yielding | sin pé | < 1 for all p. 
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TABLE I 

; ; : Max. Value of Column 3 

é s/f ho eo sings Column 4 
w/2 2 1.0 1.0 1.0 
r/3 1 1.15 1.0 1.15 
Qxr/3 3 1.15 1.0 1.15 
r/5 0.38 1.70 1.62 1.05 
24/5 1.38 1.05 1.0 1.05 
3/5 2.62 1.05 1.0 1.05 
4n/5 3.62 1.70 1.62 1.05 
w/7 0.194 2.31 2.255 1.023 
Qa/7 0.750 1.28 1.25 1.023 
32/7 1.554 1.0238 1.0 1.023 
An/7 2.444 1.023 1.0 1.023 
52/7 3.226 1.28 1.25 1.023 
62/7 3.806 2.31 2.255 1.023 





made in assuming sin pé goes through unity. That ratio is 1/cos (6/2) 
where 6 is given by (56). All values for a given g in (57) result in the same 
error, but the various values of r indicate the values of 6 and s/f at which 
that error will appear. lig. 14 summarizes the data of Table I; for p 
ranging up to infinity it is only at the discrete values of 6 given by (57) 


that the maximum of ky sin pé differs from ky . 


2.6 


o 











MAXIMUM VALUE OF Kig SIN Pd 


Fig. 14 — Maximum value of ki sin pé vs s/f. 
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If p were finite and the ratio s/f was varied, the plot of Fig. 14 would 
presumably show finite-width dips of the same over-all depth as those 
plotted. 


APPENDIX B 


It is the purpose of this appendix to point out that the “thin lens’ de- 
scription of light ray propagation leads to the following conclusion (see 
Fig. 15): 

The slope and displacement of the output ray of an arbitrary sequence 
of lenses for an input ray of slope ro’ and displacement 7; is exactly the 
algebraic sum of the slopes and displacements of the output ray found 
(z) for an input ray of slope ro’ with zero displacement from the axis, 
and assuming all lens displacements d, = 0, (77) for an input ray of dis- 
placement 7; with zero slope and assuming all lens displacements d, = 0, 
and (777) for an input ray of zero slope and zero displacement and as- 
suming one lens displacement at a time is nonzero, summing the ray 
output slopes and displacements thus found over all lens displacements. 

The proof is as follows: For the nth lens in a sequence of lenses (see 


Tig. 1): 
tn = Tra — (= " ‘e) (58) 


where r,, is the slope of the ray immediately following the nth lens and 
rn is the displacement at the nth lens. We note that the angular lens 
rotation ¢, does not affect the ray propagation, an approximation which 
implies that 





(7x — dn) COS Yn = (Tn — an) (59) 
or 
On <1. 
LENS NO.1 NO. 2 NO.3 NO.4 
(FOCAL LENGTH =f,) fa f3 4 
SLOPE=I 
\ 
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Fig. 15 — Light-ray path in an arbitrary lens system. 
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We may expand (58) to form 


, ’ (ry = d;) (rz — dz) (Fr, == dn) 
= Bg it ph ee Nees AN EE Nr 60 
r, To hi fe f, ( ) 
Similarly, the displacement r, at the nth lens is 
lm = Tra + Sn n— (61) 
which may be expanded to 
Tr = Toa + Sn {ni ee WG) 
fi 
(62) 
= (re 4 de) Ee Sickie (ra a a) 
fo fra 
which is valid for n = 2. 
We may examine each term of (60) and (62) and find that 
ry! = Agro’ + Bari + 2, om fa (63) 
m=1 m 
; man din 
th = Cato + Dari + 2 Bm (64) 


in which A,, B,, C,, D, and the a» and 8» are all independent of 
ro, 7, and the d», . 
We have thus proven the above-stated conclusion. 
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Analysis of a Tubular Gas Lens 


By D. MARCUSE and S. E. MILLER 
(Manuscript received June 8, 1964) 


If a cool gas is blown into a hot tube, at acts as a positive lens which will 
focus a light beam passing through the tube. 

Using a theory presented in Ref. 3, we give curves which show the temper- 
ature distribution in the tube as a function of the distance from the tube axis 
and also as a function of the distance along the axis. 

The focusing power of the lens is described by the difference in phase angle 
between a ray on the tube axis minus a ray at arbitrary distances from this 
axis and also as the second derivative of the phase angle on the axis of the 
tube. The phase curves, as a function of distance r from the tube axis, follow 
very closely an r? dependence. Expressions are given for the focal length of 
the lens. 

The power consumption of the lens ts discussed, and a figure of merit is 
defined as focusing power per watt. The gas used for this lens should be se- 
lected such that (n — 1)/k is as large as possible (where n is the index of 
refraction, k the heat conductivity of the gas). 

Using CO2 and a }-inch ID tube 5 inches long heated 20°C above the in- 
coming gas, a focal length of 5 feet with a power consumption of 0.325 watt 
is calculated; the focal length is inversely proportional to power consumption 
within certain limits. 


I. INTRODUCTION 


A communications system using light as the carrier of intelligence 
needs an efficient medium to propagate light from transmitter to re- 
ceiver. Among the several alternatives, the idea of Goubau and Schwer- 
ing’ of confining and propagating an electromagnetic wave with a system 
of lenses appears promising. However, in a lens-waveguide system there 
is a wide range of possibilities as to what types of lenses to use. Conven- 
tional glass lenses present problems, since they may not only absorb 
light in the glass medium itself, but furthermore present important re- 
flection losses which can be only partially avoided by special techniques 
such as coating the lens surfaces or making use of the Brewster angle. 
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Even if such corrective measures are used, there is still residual reflection 
and scattering of light due to unavoidable surface irregularities. 

It appears that most of the problems connected with glass lenses could 
be overcome if, instead of a high-index medium such as glass, a very low- 
index focusing medium were used. If the transition from air into a dense 
medium could be avoided, the problem of light reflection would not exist. 
Gases present themselves as an obvious choice of a low-index dielectric 
medium. Their dielectric constant can be influenced by changing their 
density. A change of density is most easily effected by varying the gas 
temperature. 

D. W. Berreman? built a successful gas lens by maintaining a tempera- 
ture gradient between a hot helix and a cold cylindrical enclosure. Al- 
ternatively, D. W. Berreman and S. E. Miller proposed a gas lens formed 
by blowing a cool gas into a hot tube (Fig. 1). Since the gas heats up first 
at the wall of the tube and remains cool longer at its center, it has a 
density distribution of higher-density gas in the center of the tube and 
decreasing density towards the wall. Since an increase in density is ac- 
companied by an increase in dielectric constant, it is easy to understand 
that the cool gas flowing through the hot tube acts as a positive lens and 
tends to focus a light beam traveling along the axis of the tube. 

We present in this article some theoretical results of the temperature 
distribution in the gas and the difference in phase angle between two 
light beams, one traveling along the tube axis and the other traveling 
closer to the wall of the tube. This phase difference is a measure of the 
focusing power of the lens. For an economical lens we want maximum 
phase shift with a minimum of thermal power. We present curves show- 
ing the power consumption of the lens as well as the ratio of phase differ- 
ence to power consumption. These data allow the construction of an op- 
timum lens. Different gases give different lens properties. A gas is most 
efficient if the ratio (n — 1)/k is large, where n is the index of refraction 
of the gas and k is its heat conductivity. 


Il. TEMPERATURE DISTRIBUTION 


The theory of temperature distribution in a cool gas which is blown into 
a hot tube of constant temperature is presented in Ref. 3. 


WARM TUBE 
COOL GAS (TEMPERATURE, Tw) WARM GAS 
U 


—> 
Z=0 








— 


Z 


Fig. 1 — Gas lens using forced flow in a tube. 
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It is assumed that the gas flow is laminar and has the radial velocity 
distribution of a viscous fluid 


v(r) = voll — (r/a)’] (1) 
where 


r = distance from tube axis 
radius of tube 
gas velocity at r = 0. 


I 


Vo 
The temperature T of the gas is given as 
= T, —T 


where 7’,, is the temperature of the wall of the tube. It is normalized with 
respect to 


= Ty, — To 


with 7) being the temperature of the cool gas before it enters the hot 
tube. 6/69 is expanded in terms of functions R,(r/a), which are shown in 
Fig. 2 for n = 0, 1 and 2. Values of R,(r/a) are listed in Table I. The 
temperature depends on the distance z measured from the beginning of 
the hot tube, the gas velocity vp , and the following material parameters 


k = heat conductivity measured in cal/cem sec deg) 
(deg = degrees Kelvin) 
gas density in gram/em°® 
specific heat at constant pressure in cal/gram. 


p 
Cp 


ll 


All these parameters depend somewhat on the temperature but are con- 
sidered constant in the derivation of the theory. They enter the equations 
in the combination 


a = k/avopep . (2) 


Taste |— FR Functions or Fic. 2 











x Ro(x) F(x) Ri(x) R2(x) 

0 1 1 1 1 

0.1 0.9819 0.9805 0.8923 0.753 
0.2 0.9290 0.9261 0.6067 0.206 
0.3 0.8456 0.8432 0.2367 —0.290 
0.4 0.7382 0.7382 —0.1062 —0.407 
0.5 0.6147 0.6175 —0.3399 —0.204 
0.6 0.4833 0.4880 —0.4317 0.104 
0.7 0.3506 0.3535 —0.3985 0.278 
0.8 0.2244 0.2244 —0.3051 0.278 
0.9 0.1069 0.1041 —0.1637 0.144 
1.0 0 0 0 0 
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Fig. 2— Functions Ro , Ri and Re vs r/a. 


The first three terms of the infinite series describing the temperature dis- 

tribution in the tube are 

s = 1.477 exp (- 7.3160 ‘) Ro (") — 0.810 exp (—44.360 2 
0 


a 


-Ry (") + 0.385 exp (—1066 ‘) R, (:) SP wai 
a a a 


(3) 


ANALYSIS OF A TUBULAR GAS LENS 1763 


The approximation is fairly poor at z = 0. However, the exponential 
factors in the higher terms of the series drop off very rapidly as z in- 
creases so that the approximation is already very good for values of 


a(z/a) > 0.01. 


Fig. 3 shows 6/4) at r = 0 as a function of o(z/a). It is apparent that for 
a(z/a) > 0.05, the distribution of 6/6) drops off exponentially. Fig. 4 
shows the r/a dependence of 6/6 for different values of o(2/a). 

To make Figs. 3 and 4 more meaningful, we list in Table IT the material 
parameters for several gases at 20°C and a pressure of 760 mm Hg. 


III. POWER CONSUMPTION 


The principle of operation of our gas flow lens requires that we heat 
the cool gas inside the hot tube. Even if we neglect all power losses to the 
environment, we have to spend a certain amount of heat power to oper- 
ate our lens. For a subsequent study of lens efficiency we need to know 
this basic power consumption. At any given length z of the tube we ob- 
tain the power absorbed by the gas as 


Poy = : [T(r,z) — Tolpcpy(r)2ar dr 


a 2 i, 
= Qmpeprebe | r E _ (‘) | E _ or) | dr. 
0 a Oo 


(4) 























Fig. 3— Normalized gas temperature on tube axis vs normalized distance 
along tube. 
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FIRST THREE ) 
TERMS OF SERIES 











r/a 
Fig. 4— Normalized gas temperature vs radial position with longitudinal posi- 


tion as a parameter. 


In order to be able to perform the integration easily, we restrict our- 
selves to values of 


a(z/a) > 0.05 
which allows us to express 0/60 by the first term of (3). In addition, we 
replace Ro(a) by 
R(x) & F(x) = 1 — 2.06 a + 1.06 2’. (5) 
This approximation deviates no more than 2.5 percent from the actual 


value of Ro(x). The values of F(x) can be compared to those of Ro(x) 
in Table I. The integration can now be performed easily, and we obtain: 


pS % a pcgtte| 1 — 0.820 exp (—7.2166 “)|. (6) 
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Taste II—Gas PARAMETERS vs TEMPERATURE 











k c a (2 — 1)/k 
Gas | calfem see | Ggram/emsy | (cal/eram | 'Cemysec)” | 1 ‘| (em sec deg/ 
CO2z 3.93 10-5 | 1.84 1073 0.199 0.107 | 4.20 10-4 10.7 
NH; 5.90 1075 | 0.72 1073 0.523 0.157 | 3.48 10-4 5.9 
CH, 7.80 10-* | 0.67 1073 0.528 0.220 | 4.13 1074 5.3 
Air 6.28 1075 | 1.21 107% 0.240 0.216 | 2.73 10-4 4.35 
He 41.0 107° | 0.084 1073 3.39 1.44 1.23 10-4 0.30 
He 35.0 1075 | 0.166 1073 1.25 1.69 0.34 10-4 0.097 





Using p and c, from Table II, P is in calories/sec. Fig. 5(a) shows the 
power consumption as a function of normalized lens length, oz/a. An 
alternative form of (6) brings out the dependence of P on the flow 
velocity vo more clearly: 


P = key 2 E — 0.820 exp (—za16 aie (7) 
2 V Uo 
The quantity 
V = kz/a’pe, = o(2/a)v0 (8) 


has the dimension of velocity and is characteristic of the gas and the tube 
geometry. Fig. 5(b) shows the power consumption as a function of nor- 
malized gas velocity, uo/V. 

The ratio of V/vo can be related to the time to = z/v) which it takes the 
gas particles on the axis to traverse the tube of length z with the velocity 
Vp and to a time 7 which is defined by 


dT (0,t) 
Dn dt (9) 
tr fx — T(Ot)* 


7 is characteristic of the heat diffusion rate in a gas which rests in a tube 
whose wall temperature is 7, . At t = 0 the gas has the uniform temper- 
ature Ty) < T', . Its temperature at a given radius r and time ¢ is T(r,t), 
so that 7'(0,t) is the gas temperature at the tube axis at time ¢. 1/7 is 
the time rate of temperature rise on the axis per degree of temperature 
difference between wall and axis. 

It is shown in Appendix A that 


V/v0 = o2z/a _ 0.173(to/r). (10) 


This shows that V/vp expresses the ratio of the time it takes the gas par- 
ticles (on the tube axis) to flow through the tube of length z to the heat 
diffusion rate on the tube axis. Equation (10) may be used to replace 
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Fig. 5— (a) Normalized power flow to the gas vs longitudinal position. (b) 
Normalized power flow to the gas vs normalized gas velocity. 


V/vo in (7) and in the other places where V/v or oz/a appears, resulting 
(for example) in 


exp (-7a16 2) = exp (-7a16 r) = exp (-1.265 “). 
a Vo T 


The parameters lens length z and gas velocity vp are important variables 
for other reasons, and the first two forms of the exponential may be pre- 
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ferred. We may note, however, 
= 0.173(a’pcp/k) 


and typical values are 0.161 second and 0.08 second for CO: and air, re- 
spectively, when a = 0.125 inch. It is surprising that this time constant 
is so short. ; 

Another physical meaning one can give V is that it represents that 
velocity of gas flow along the pipe axis which assures that 6/6. drops from 
its initial value of one at the beginning of the tube to 


6/8) = 9.910 * + 10° 


on the axis at its end. 


IV. FOCUSING ACTION 


A lens focuses because the optical path length varies for rays traveling 
at different distances from its axis. 

We describe the focusing action of our lens by the phase angle of a ray 
traveling parallel to the axis of the structure. The phase angle is given by 


(v2) = Bo [ es (11) 


Here, 85 = 27/Xo is the free-space propagation constant of the light beam, 
and n is the index of refraction of the gas. 


" — as To 
nrc) = 1+ (m — 1) Tra) (12) 
Poi4% (1 a Ey Ty 60)” (13) 
To 0 


The last step is an approximation for 60/7'9 «< 1. The temperature in 
(12) has to be expressed in degrees Kelvin. 
We decompose ®(7,z) into two parts: 


®(7,z) = e + v(7,2). (14) 
The first part 


e= ml 1+ (nm — 1) G-#)]. (15) 


is independent of the position r of the ray in the gas lens, while the sec- 
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ond part 


= 4) 9 [7 (rx) 
b= Bolno Nef wae 


6 V 
= Boz(m — 1) my {o.202R () E — exp (—7a16 ry 
r V 
— 0.0183R, (.) E — exp (-443 r)) 
+ 0.00363R, (*) E — exp (—106 r)) Ae Gog } 
a Vo 


accounts for the different amounts of phase shift in different parts of the 
lens. 

The difference between the phase angle of a ray traveling along the 
lens axis and the phase angle of a ray traveling at a distance r from the 
axis is 


(16) 


A® = Boz(m — 1) mi {0200(1 — Ry) E = exp (- 7.316 ry 
— 0.0183(1 — R:) E — exp (-443 ry (17) 
0 


+ 0.00363(1 — Rez) E — exp (—106 ry + a 
0 


This form of A® shows clearly its dependence on flow velocity for a fixed 
tube length z. To study the dependence of A¢ for fixed flow rate and vary- 
ing length, the following form is preferable. 


A® = Bo(a/c ) (No = 1) (60/T) {0.202(1 = Ro) 
-(1 — exp(—7.3160z/a)) — 0.0183(1 — R,) (1 — exp(—44.302/a)) (18) 
+ 0.00363(1 — Re) (1 — exp(—106cz/a)) + ---}. 


Fig. 6 shows a plot of A® versus length of lens for r/a = 0.4. That 
means that we compare the phase difference between a ray on the axis 
and another at distance r = 0.4a from the axis of the lens. 

Fig. 7 shows the phase difference at a fixed length z as a function of 
gas velocity. In this case, A® goes through a maximum which for 
r/a = 0.4 occurs at %/V = 6.9. The position of this maximum depends 
somewhat on the radius r of the ray used for phase comparison with the 
axial ray. Appendix B gives the theory and Table III gives the values of 


ANALYSIS OF A TUBULAR GAS LENS 1769 


0.032 








0.028 








0.024 








0.020 


0.016 








0.012 

















_ Fig. 6 — Phase difference between ray at r = 0 and ray at r = 0.4a vs longitu- 
dinal position. 


the position of the maximum w/V for different values of r/a. Table III 
shows that the position of the maximum does not change much with the 
radial position of the reference ray. 

We can explain physically this maximum in A® versus w% as follows. At 
very low gas velocity the majority of the gas in the tube is at the same 
temperature — the temperature of the walls, 7, . It is heated up in a time 
r [see (9)] after entering. With little temperature difference between the 
gas at r = 0 and at r > 0 there is little Ad’. As vp increases, the gas on 
the axis remains at or near 7’) , but that nearer the walls is heated because 
it flows more slowly [see (1)] and larger A® develops. Beyond some ve- 
locity, further increases in velocity cause the gas at r = 0.4a (for ex- 
ample) to leave the tube at lower and lower temperatures — i.e., less 
temperature difference will exist between r = 0 and r = 0.4a because the 
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Vig. 7 — Phase difference between ray at r = 0 and ray at r = 0.4a vs gas velocity. 


transit time of the gas through the tube becomes less than the thermal 
diffusion time constant 7. Thus, at high velocities the A® decreases. Note 
that this explanation (and the theory) depends upon laminar flow of the 
gas. If there is radial mixing of the gas, less A® would be expected than 
predicted above, and the maximum in A® versus vp might not occur. 

The second derivative d’@/d(r/a)’ is a good measure of the effective- 
ness of the lens for light rays close to its axis. For a glass lens 


b = nBod(r) 


where d(7) is the thickness of the lens as a function of the distance from 

its axis. The radius of curvature FR of the glass lens is given by 
1_ide 

R NBo dr? 

In order to be able to take the second derivative of 6 we have to express 

the functions Ry, Ri, and Re by power series with respect to r/a. For 


TasBLe III — Maximum w /V For VALUES OF r/a 


r/a 0.2 0.4 0.6 
8.2 


Vo/V 6.73 6.9 6 
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the second derivative on the axis at r = 0, it is sufficient to know the 
coefficient of (r/a)” in the expansion. Jakob® gives a series expansion of 
the R-functions. 


R, =1— 16,7(r/a)’ + --- (19) 


with Bo = 2.705, 6: = 6.66, and B, = 10.3. 
We get these values 


ey = Boz (No > dL) Oo Vo bo, 738 1 — exp (~za16 FI 
0 
a r=0 
— 0.405 E — exp (—443 ry (20) 
0 


+ 0.192 E — exp (—106 4) + vt 
0 


The maximum of this curve as a function of u/V appears at vo/V = 6.75. 

Fig. 8(a) is a plot of a normalized value of d’®/d(r/a)’ as a function 
of w/V, while Fig. 8(b) shows it (with a different normalization) as a 
function of o(z/a). 

The r/a dependence of A® is shown in Fig. 9. 

In order to show what values the phase difference might actually as- 
sume, and also to compare different gases, we have plotted A® in Fig. 
10 for several gases and the following geometry and flow rate: 


2a = 0.25 inch 

Vo 212 cm/sec, corresponding to 2 liters/minute or 4.77 miles/hr. 
Bo = 1.07 X 10° cm”, corresponding to \>x = 58904 

Tw = 3438°K 

Ty = 293°K 


and with the values of m» — 1 and o as listed in Table IJ. These curves 
assume a tube length z so long that no further A® would be realized with 
a longer z (i.e., outgoing gas at uniform temperature). 

It is interesting to compare the r/a dependence of A® to the simple 
function c(r/a)*. For this purpose we use (17), which is written so that 
A® = Oatr = O. Fig. 11 gives the normalized value of A® as a function 
of r/a for several values of the gas velocity vo/V. For comparison the 
function c(r/a)’ is shown by dotted lines. The constant c is adjusted so 
that both curves coincide at r/a = 0.4. The actual curves of, A® are 
surprisingly close to the simple square law dependence in all cases. If the 


1772 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


2.0 


(a) 



































co 





alo 
(No-1)B0a 4 














Q 
D|N 


Fig. 8 — (a) Normalized d?é/d(r/a)? vs gas velocity; (b) normalized d*b/d(r/a)? 
vs longitudinal position. 


gas lens could be treated as a thin lens, it would act very similar to a 
glass lens with spherically curved surfaces. 

However, the gas lens is not thin and the question presents itself: 
how do different sections of the lens contribute to the over-all focusing 
effect? Fig. 12 shows the phase difference A® between a ray on the axis 
at r = O and aray at r fora fixed value »/V = 6.9 for different sections 
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Fig. 9 — Normalized A® vs r/a with longitudinal position as a parameter. 


of the lens. The curve showing A¢ for the section 0 — z has areaey been 
shown in Fig. 11. The other curves show A® in the first } of the lens 
[curve 0 — (4)z], the second 4[curve (4)z — (2)z], and the last 4[curve 
(})z — z]. The contributions are surprisingly different at different radii 
and do not resemble simple (r/a)” dependences. However, they all add 
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Fig. 10 — A® vs r/a when a = 0.125 inch, vp = 212 cm/sec (corresponding to 
2 liters/min), \o = 5890 A, Tw = 343°K, To = 293°K and ¢Z/a > 1. 


up to the 0 — z curve which does resemble the (r/a)” dependence very 
closely, as was shown in Fig. 11. 


V. FIGURE OF MERIT 


The focusing action of the gas lens becomes independent of the length 
of the lens if «(z/a) > 1, as Figs. 6 and 8(b) show. We also know that, 
for a fixed length of the lens, there is an optimum flow velocity, as shown 
by Figs. 7 and 8. 

For practical applications one would like not only to obtain an effective 
lens but also to do so with a minimum expenditure of power. It is, there- 
fore, interesting to study the lens action, that is, d’@/d(r/a)”, per unit of 
applied power. We may introduce the ratio ° 


_1f d@ 
fe P Gy) ae 
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Fig. 11 — Normalized A® vs r/a with gas velocity as a a dotted curve 
represents variation as (r/a)?, normalized to ordinate at r/a = 0.4 


as the figure of merit of the lens. From (6) and (20) we obtain 





280 mo — 1 f F 
Ma oe a 
Be 
aT) I r — 0.820 exp (—7.316 r) 22a) 
0 
with 
V 
= 0.738 E — exp (-7.216 Y] — 0.405 
Vo 
(22b) 


=| exp | = 44.3 V + 0.192 | 1 — exp 2406) , 
Vo Vo 


1776 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


1.0 
4.8 





0.6 
0.5 
0.4 











0.3 











0.2 



































0.01 | 
0.006 
0.005 
0.004 


























Fig. 12 — A® contributed by first third, second third, and output third of gas 
lens for vo/V = 6.9. 


For a given value of »/V, the figure of merit is proportional to 
(mo — 1)/k. It is advantageous to make this number as large as possible. 
The figure of merit /, given in (22b), is plotted in Fig. 13. 
The gases in Table II are arranged in decreasing order of (mo — 1)/k. 
Of all the gases listed in that table, carbon dioxide is best suited for a gas 
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Fig. 13 — Figure of merit given by (22a) and focusing power (as in Fig. 8) vs 
gas velocity. 


lens. This does not mean, however, that gases with larger values of 
(no — 1)/k cannot be found. 


VI. FOCAL LENGTH 


We have seen that A® varies nearly as (7/a)’. In the region where the 
lens is weak, we may treat it as a thin lens and obtain directly a simple 
expression for focal length. 

For any thin lens it may be shown that the focal length f is given by 


f = 3Bo(7°/A®) (23) 


where 


A® is the phase shift added on axis as compared to that for a ray at 
radius r 


80 = phase constant of the region surrounding the lens. 
We obtain A® from (17), which is given by the following for the gas 
velocity set to maximize A® at (r/a) = 0.4 —i.e., at (w/V) = 6.9: 


Ab = 0.839(r/a)"(O0/T)Bo2(nm0 — 1). (24) 
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Putting (24) into (23) we obtain the following expression for the focal 
length of a weak gas lens: 
ea 

f = 0.596 OGD: 
If a = 0.125 inch, z = 5 inches, 7) = 293°K, and 6) = 20°C, we find 
f & 5 feet using CO, as the gas and f & 8 feet using air as the gas; the 
power transferred to the gas with CO. would be 0.0775 cal/see = 0.325 
watt. 

When the gas lens is not weak, one should take into account that the 
refractive index varies both with radial position and with longitudinal 
position. Work is under way to analyze this very difficult situation. A 
simpler approach, and one which should give a first-order answer for 
gas lenses operated near the velocity producing maximum A¢® [see (17)], 
is to assume a medium within the lens 


(25) 


nrc) = nfl — lags’) (26) 
where 
x = distance (within Iens )from start of lens 
r = radius 
Na = index of refraction on the axis. 


Tor the gas velocity (u/V) = 6.9 it may be shown that 

a, = — = (m — 1). (27) 

ad 
In other unpublished work the authors have shown that the radial posi- 
tion of a ray (or of the axis of a Gaussian beam mode) is 
_— 7, 7 os 

r = 7; cos Wage + —= sin Va (28) 
V ae 
where 


r; = displacement of ray at lens input 


r;/ = slope of ray at lens input. 


We can use this general result to specify the focal length of a strong (or 
weak ) gas lens with reference to Tig. 14. All input rays with zero slope 
will converge to a point on the axis a distance d beyond the output face 
of the lens (Fig. 14), where 


r= Toot (wan (29) 
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LENS 


d = Jag COT ( [82 t) 





ee ee 
{a2 Sin (Yaa t) 


Fig. 14 — Diagram defining focal length and position of the equivalent thin 
lens for a thick gas lens. 


¢t is the length of the lens, and n, has been taken as unity. An equivalent 
thin lens may be located a distance f back from the focal point, where 


1 
I> Zazsin (Vand) oe 


This expression for focal length is valid up to (+/aet) = 2/2, at which 
point d = 0. For (+/ast) > 2/2 the rays cross within the lens, per (28). 
For (+/az;t) « 1 it may be shown that (30) passes into (25) and the 
location of the equivalent thin lens is in the center of the distributed lens. 

EK. A. J. Marcatilit has solved Maxwell’s equations for a medium 
characterized by (26) and has found the normal modes for a sequence of 
lenses composed of segments of such a medium. This work relates closely 
to a sequence of gas lenses described in this paper. Experiments with 
tubular thermal gas lenses are reported by A. C. Beck.® 


VII. CONCLUSION 


When a cool gas is blown through a warmer tube, the gas at the axis 
has a lower temperature than that near the walls. Thus the density and 
refractive index is larger at the axis and a converging lens is formed. If 
the tube were at a lower temperature than the input gas, a diverging lens 
would be formed. 

There is an optimum gas velocity for maximizing the focusing power 
of such lenses, and expressions are given for this velocity. It turns out that 
the optimum transit time for gas through the tube is approximately the 
time constant for temperature changes in a gas at rest in the tube, which 
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for typical gases (air and carbon dioxide) is about 0.1 second in a }-inch 
ID tube. Although not discussed in this paper, it is found that a 4-inch 
tube 6 inches long yields (at the optimum velocity for focusing power) a 
Reynolds number well below that at which turbulence is expected. 

Expressions are given for focal length and a figure of merit expressed 
as focusing power per watt of power transferred to the moving gas. 

The best gas is one with a maximum (n — 1)/k, where n is the refrac- 
tive index and k is the heat conductivity. 


APPENDIX A 


Derivation of (10) 
The relation (10) 
V/vp = 0.173 (to/7) (31) 


can be derived as follows. 
The time development of a cool gas resting in a tube of wall tempera- 
ture 7’, can be described as follows 


a) Jo (w, ‘) ‘ 
Pat iT) oe (32) 
n=1 Wnd 1(Wn) 


with 
An = (k/a’pcp)w,’ and Jo(w,) = 0. 


At t = 0, (32) becomes 7'(r,0) = 7, which is constant throughout the 
tube’s cross section. 

As time progresses the exponents ),t become large, so that very soon 
the first term of the series is the only contributing factor. Neglecting all 
the terms except the first, we get 


aT (0,t) 
ak eee ee eee (33) 
a Oe (0 aaa ar 


and substituting z = voto we get (31). Since we neglected all but the first 
term in the series, (33) represents the asymptotic value which 1/7 as- 
sumes after the initial transients have died down. 

To obtain a feeling for the accuracy of the approximation involved in 
deriving (33), we write down the ratio of the second to the first term in 
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the sum (32) forr = 0: 


wi 1(w1) 


t 
ines = t} = 0. —496 et 
TF Caagy OXPL= (Xe — a) = 0.666 exp ( ‘) 


This ratio is 10 for ¢/r = 0.986 and is 10° for ¢/r = 0.45. The approxi- 
mation is excellent for times £ = 7 and is quite good for ¢ > 0.5r. 
For the special example used in Fig. 10 we get for 


CO. : 7 = 0.161 sec 
alr: 7 = 0.08 sec. 


APPENDIX B 


The Maximum of (17) 


We seek an expression for the value of v/V which brings A®, equation 
(17), to a maximum: 


d(A®) 
d(v/V) 


| 020211 — Ro) {1 — exp (-7316 *) (1 + 7.316 e x 
— 0.0183(1 — Ri) rr — exp (-s43 ry ( + 44.3 y * 


+ 0.00363(1 — Re) ‘ — exp (106 7) (1 + 106 Nl. 


We set (34) equal to zero, and noting that the second and third exponen- 
tials are small, we neglect them (to be justified by the solutions thus ob- 
tained) yielding 


exp (—z.16 r) (1 + 7.316 Y) 
Vo Vo 


= 1 — 0.0906 


2 a Gi 1) r 


(35) 
G1 — RB) 4 goyg SL = Re) Rs) 


(1 — Ro) (1 = Ro)’ 


Equation (35) gives the approximate value of vo/V at the maximum of 
A@ for any chosen radius, r/a. Additional terms in (34) can be taken if 
more accuracy is desired, which was done in computing the w/V for 
r/a = 0.2, as given in the body of the paper. 
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Hollow Metallic and Dielectric Wave- 
euides for Long Distance Optical 
Transmission and Lasers 


By E. A. J. MARCATILI and R. A. SCHMELTZER 
(Manuscript received June 12, 1964) 


The field configurations and propagation constants of the normal modes 
are determined for a hollow circular waveguide made of dtelectric material 
or metal for application as an optical waveguide. The increase of attenua- 
tion due to curvature of the axis 1s also determined. 

The attenuation of each mode is found to be proportional to the square 
of the free-space wavelength \ and inversely proportional to the cube of the 
cylinder radius a. For a hollow dielectric waveguide made of glass with 
vy = 1.50, = 1p, anda = 1 mm, an attenuation of 1.85 db/km is predicted 
for the minimum-loss mode, EHy,. This loss is doubled for a radius of 
curvature of the guide axis R ~ 10 km. Hence, dielectric materials do not 
seem suttable for use in hollow circular waveguides for long distance optical 
transmission because of the high loss introduced by even mild curvature 
of the guide axis. Nevertheless, dielectric materials are shown to be very 
attractive as guiding media for gaseous amplifiers and oscillators, not only 
because of the low attenuation but also because the gain per unit length of 
a dielectric tube containing He-Ne ‘“‘masing” mixture at the right pressure 
can be considerably enhanced by reducing the tube diameter. In this applica- 
tion, a small guide radius is desirable, thereby making the curvature of 
the guide axis not critical. For) = 0.6328y and optimum radiusa = 0.058 
mm, a maximum theoretical gain of 7.6 db/m is predicted. 

It ts shown that the hollow metallic circular waveguide is far less sensitive 
to curvature of the guide axis. This 1s due to the comparatively large complex 
dielectric constant exhibited by metals at optical frequencies. For a wave- 
length X = 1p and a radius a = 0.25 mm, the attenuation for the minimum 
loss TEo, mode in an aluminum waveguide is only 1.8 db/km. This loss 
ts doubled for a radius of curvature as short as R ~ 48 meters. For) = 3yu 
and a = 0.6 mm, the attenuation of the TE, mode is also 1.8 db/km. The 
radius of curvature which doubles this loss is approximately 75 meters. The 
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straight guide loss for the EH, mode for) = 1p anda = 0.25 mm ts 57 
db/km and is increased to 320 db/km for } = 3u anda = 0.6 mm. 

In view of the low-loss characteristic of the To. mode in metallic wave- 
guides, the high-loss discrimination of noncircular electric modes, and the 
relative insensitivity to axis curvature, the hollow metallic circular wave- 
guide appears to be very attractive as a transmission medium for long 
distance optical communication. 


I. INTRODUCTION 


During recent years the potentially large frequency range made avail- 
able to communications by the development of the optical maser has 
stimulated much interest in efficient methods for long distance trans- 
mission of light. The most promising contenders for long distance optical 
transmission media consist of sequences of lenses or mirrors, highly 
reflective hollow metallic pipes, and dielectric waveguides.!~!° 

In this paper we present an analysis of the field configurations and 
propagation constants of the normal modes in a hollow circular wave- 
guide which, because of its simplicity and low loss, may become an 
important competitor. The guiding structure considered here may con- 
sist of an ordinary metallic pipe of precision bore whose inner surface 
is highly reflective, or of a hollow dielectric pipe —i.e., one in which 
the metal is replaced with dielectric. Although the transmission charac- 
teristics of metallic waveguides are well known for microwave fre- 
quencies, this theory is invalidated for operation at optical wavelengths, 
because the metal no longer acts as a good conductor but rather as a 
dielectric having a large dielectric constant. In the subsequent analysis, 
therefore, both the dielectric and metallic guide are considered as special 
cases of a general hollow circular waveguide having an external medium 
made of arbitrary isotropic material whose optical properties are charac- 
terized by a finite complex refractive index. If the free-space wavelength 
is much smaller than the internal radius of the tube, the energy propa- 
gates not in the external medium but essentially within the tube, bounc- 
ing at grazing angles against the wall. Consequently, there is little energy 
loss due to refraction. The refracted field is partially reflected by the 
external surface of the tube and may, in general, interfere constructively 
or destructively with the field inside the tube, decreasing or increasing 
the attenuation. Because of the difficulty of controlling the interference 
paths, it seems more convenient to eliminate the effect completely by 
introducing sufficient loss in the dielectric or, in the case of a glass di- 
electric, by frosting the external surface. The field in the hole of the 
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tube is then unaffected by wall thickness. We shall therefore simplify 
the analysis of the hollow circular waveguide by assuming infinite wall 
thickness, as depicted in Fig. 1. 

This structure will be shown to be attractive as a low-loss transmission 
medium for long distance optical communication as well as for optical 
gaseous amplifiers and oscillators. It is known, for example, that in a 
tube containing a He-Ne mixture such that the product of radius and 
pressure is roughly a constant, the gain per unit length is inversely 
proportional to the radius of the tube." On the other hand, we find in 
this paper that the attenuation of the normal modes is inversely propor- 
tional to the cube of the radius. Hence there is an optimum tube radius 
for which the net gain per unit length is a maximum. Furthermore, 
because the guidance is continuous, there is no need for periodic focusing. 
Consequently, no restriction need be imposed on the length of the 
amplifying or oscillating tube. 

We begin by analyzing an idealized guide having a straight axis and 
a cylindrical wall. The results are then extended to include the effects 
of mild curvature of the guide axis by finding a perturbation correction 
for field configurations and propagation constants of the idealized 
straight guide. 


II, MODAL ANALYSIS OF THE GENERAL STRAIGHT CIRCULAR WAVEGUIDE 


Consider a waveguide consisting of a circular cylinder of radius a and 
free-space dielectric constant «9 embedded in another medium of dielec- 
tric or metal having a complex dielectric constant «. The magnetic 
permeability yo is assumed to be that of free space for both media. We 
are interested in finding the field components of the normal modes of 
the waveguide and in determining the complex propagation constants 
of these modes. 

The problem is substantially simplified if it is assumed that 


ka = 22a/X > |v | tam (1) 





€0) [kg ————>- Z 





Fig. 1 — Hollow dielectric waveguide. 
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and 
| (v/k) -1] «1 (1) 


where k = wr/equy = 27/d is the free-space propagation constant; Unm 
is the mth root of the equation J,_1(Uam) = 0, and n and m are integers 
that characterize the propagating mode; v = +/e/e is the complex 
refractive index of the external medium; and y is the axial propagation 
constant of the mode under consideration. The first inequality states 
that the radius a is much larger than the free-space wavelength \. In 
the case of metalization of the external medium, | v | may be quite large 
but is finite at optical frequencies. The second inequality restricts our 
analysis to low-loss modes, which are those whose propagation constants 
y are nearly equal to that of free space. 

The field components of the natural modes of the most general circular 
cylindrical structure with arbitrary isotropic internal and external media 
have been determined by Stratton.” This structure supports three types 
of modes: first, transverse circular electric modes whose only field com- 
ponents are Hy, H, and H, ; second, transverse circular magnetic modes 
whose components are Ho, H, and E,; and third, hybrid modes with all 
the electric and magnetic components present. The approximate field 
components of these modes are written below. They have been derived 
using the inequalities (1) and neglecting terms with powers of \/a larger 
than one. The superscripts 7 and e refer to the internal and external 
media, respectively. 


1. Cireular electric modes TEo, (n = 0) 


Eom’ = Ji(kir) ) 
Hyom’ So Ve Ji(kir) exp t(yz =— wt ) 
Ho 
Ham = —1 (/* Fa Jo(k) | | 
Mo ka (2) 
Eom = —] ) 
e /& . Uom 
Hyom = N/ ik “kar? — 1) Jo (uo) 


Hon = —iV¥—1 (/2 neyee ane eenees 


Ho 


2. Circular magnetic modes TMom (n = 0) 


fae = Ji(kir) 


Eom’ = i om Joker) 


Hom = a Ia(h) 
Ho 


exp i(yz — wt) 








: (3) 
Eyom rs a 
é 7 F Sica : 
f 2 2 Pa Soa — = 
Eom = VV a Wae= po [Iker —a) + ye — wt] 
Pee ee es €0 
60 uo | J 
3. Hybrid modes EH, m (n ¥ 0) 
. 2 
Eonm = | Jt (kr) a ets Vr — 1 140) || 
2nka 
-cos n(0 + 4) | 
Emm = = (kr) + x= V/r— 1 a(t) | exp i(yz—wt) 
-sin n(6 + 6) 
Einn = —1 = Jrlkir) sin n(@ + 6) | 
Honm = wa Ernm 
Ho 
Aint = -4/2 Bown’ 
Ko (4) 


Him = — (/ © Binm etn n(0 + 6) 
Mo 





Eenm = cos n(0 + 4) 
Eyam = sin n(0 + 0) i Wao s(t) 
Emn = —VP? —1 sin n(6 + 6))-exp ike(r — a) + ye — ail] 





e€ 2 €9 e 

Honm = /2 Eyam 
Mo 

e €0 e 

Lin = >= {/* Eonm 
Ho 


Ham = — {/* Einm ctn n(8 + 6) 
Mo 
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| re | atl = 3; (7.0165) as 
a a | 4 


Pr 
uP (3.0324) F 
a 
TEo TEgs TMo, TMoe 


(a) CIRCULAR ELECTRIC MODES 


J.4(7.5885) Nir ¥-a{t10883) we 
a 


EH_3, . EH_32 
(C) HYBRID MODES 





Fig. 2 — Electric field lines of modes in hollow dielectric waveguides: (a) circu- 
lar electric modes, (b) circular magnetic modes, (c) hybrid modes. 
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where the complex propagation constant y satisfies the relationships 
ki? = kh -_ y 
2 272 Ps (5) 
Ke = vho ay 
and Un» is the mth root of the equation 
Dal beak) = 0. (6) 


As usual, | 7 | is the number of periods of each field component in the @ 
direction, and m is both the order of the root of (6) and the number of 
maxima and minima of each component counted in the radial direction 
within the internal medium. The constant @) appearing in (4) will be- 
come of interest later on when we study the waveguide with curved 
axis, because it will admit any orientation of the transverse electric 
field relative to the plane of curvature of the guide axis. 

For n = 0, the modes are either transverse electric TEom (2), or 
transverse magnetic TMo, (3). The lines of electric field of the TEon 
modes are transverse concentric circles centered on the z axis. The lines 
of magnetic field are in planes containing the z axis. Similarly, the lines 
of magnetic field of the TMo, modes are transverse concentric circles 
centered on the z axis with the electric field contained in radial planes. 
The electric field lines of the modes TE , TEo2 , TMo: and TMo: are 
shown in Figs. 2(a) and 2(b); each vector represents qualitatively the 
intensity and direction of the local field. 

For n # 0, the modes are hybrid, EH,» (4); therefore, the magnetic 
and electric field are three-dimensional with relatively small axial field 
components in the internal medium. Thus the hybrid modes are almost 
transverse. 

Let us examine the projection of these three-dimensional field lines 
on planes perpendicular to the axis z of the waveguide. The differential 
equations for the projected lines of electric field in both media are 





Ldr _ Emm! 
rdée = -Henm'* 
(7) 
Lar _ Enon 
r dé Honm? : 


Exum’ as Well as Einm’ contain two terms as given in (4). Both are neces- 
sary to satisfy the boundary conditions. If we neglect the second term, 
however, no substantial error is introduced except very close (a few 
wavelengths) to the boundary, where the second term dominates as 
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the first tends to zero. With this simplification, the differential equations 
(7) in both media become identical 


(1/r)(dr/dé) = tan né. 
Upon integrating, one obtains an equation for the locus of the projected 
electric field lines 
(r/ro)" cos nO = 1 (8) 


where 7) is a constant of integration that individualizes the member 
of the family of lines. The electric field of an EH,» mode is different 
from that of EH_,» mode. 

The projection of the magnetic field lines is determined in a similar 
way. These equations are 


(r/ro)” sin n6 = 1 (9) 
for the internal medium and 
(r/ro)"” sin no = 1 


for the external medium. 

The projections of the internal electric (8) and magnetic (9) field lines 
are identical for any given mode except for a rotation of 7/(2n) radians 
around the z axis. In Fig. 2(c) the lines of the electric field in the internal 
medium are depicted for the first few hybrid modes. Again the vectors 
represent qualitatively the field intensities and directions. 

What happens at the boundary? Consider, for example, the projected 
electric lines of mode EH, , as shown in Fig. 3(a). These field lines 
satisfy (8), an equation which is valid everywhere except near the 
boundary. The boundary conditions are violated in I‘ig. 3(a) because 
there is continuity not only of the tangential electric component but 
also of the normal component. The internal normal component must be 
v’ times larger than the external one. Consequently, the electric field 
line must be discontinuous. This result is shown qualitatively in Fig. 
3(b). 

A three-dimensional representation of the field lines is far more com- 
plicated than the two-dimensional one depicted in Fig. 2. As a typical 
example, the electric field lines of the EHe. mode are shown in Fig. 4 in 
a three-dimensional perspective. 

The propagation constants of the TEom, TMom and EHnm (n # 0) 
modes are determined below (21). It is found that the hybrid mode 
EH_|, ),~ is degenerate (same propagation constant) with the 
EH | » | 42, ; 1.e., for every hybrid mode with negative azimuthal index 
there is a degenerate hybrid mode with positive aximuthal index. The 
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(a) (b) 


Fig. 3 — (a) Electric field lines of EH: mode violating boundary conditions; 
(b) same EH, mode with electric field lines qualitatively corrected. 


transverse modes TE», and TMo, and the hybrid modes IH,,, and 
EH» have no degenerate counterpart. 

If the field components of the degenerate EH_ | nj »m and EH | » | 42, 
modes (4) are added, we obtain new composite modes whose electric 
and magnetic field lines project as straight lines on a plane perpendicular 
to the z axis. Some of those composite modes are shown in Fig. 5. 

It should be noted that if the refractive index of the external medium, 





_ Fia. 4 — Cutaway view of electric field lines of EH22 mode. The axial period 
is grossly exaggerated. 
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J2(5.313§) Je(2.4175) 


OI 
O19 


EH.y, +EH3 EH.y2 + EH32 





olr3 
ors 


J-3 (6.385) J-3(9.761 r) ; 
| { | i> 


EH-a; t EHay EH-22 + EHaa 


Fig. 5 — Electric field lines of composite modes EH jn),m + TH jnys2,m - 


v, is very close to unity, then for each value of m, the TEom , TMom and 
EH:,, modes also become degenerate (17), (21) and the sum of the 
components of TEom (2) and EHe, (4) yields a new composite mode, 
as shown in Fig. 6. This mode, together with those in Fig. 5 and the 
EH,» of Fig. 2(¢), form a complete set that closely resembles the set 
found for interferometers with plane circular mirrors or for sequences 
of circular irises.’ 

Let us now consider the field intensity distribution outside and inside 
the hollow dielectric waveguide. The external field (2), (3) and (4) 
has the radial dependence 


exp [tk.(r — a)] 


Vr 
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4, (2.0165) 





Fig. 6 — Electric field lines of composite modes TEon + EHam . 


Irom (5) and (20) we obtain, neglecting terms of order vay and 
higher, k, = k~/v? — 1. The redial dependence is then 


exp [ik 7? — 1(r — a)| 
Vr 


If the dielectric is lossy, the refractive index »v has a positive imaginary 
part. The external electric and magnetic fields then oscillate with period 
of the order of \/| +/+? — 1]| and decay exponentially in the radial 
direction. The maximum field intensities in the external medium occur 
at the boundary r = a. Being proportional to \/a, these maxima are 
small. 

The field intensity inside the hollow waveguide is more interesting. 
Again if we substitute y (20) into (2), (3) and (4) and neglect terms 
of the order \/a, only the internal transverse components remain. 

For TE o, modes 


Eon’ = — 4/8 Fn = J; (von ‘) ‘ (10) 
€o a 


For TMo» modes, 


Bia = /% Hon! = J; (an ‘) . (11) 
€9 a 
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Tor EH,» modes, 


Eonm = = -4/# Hinm = a 4 —1 (eam ") cos né 
: * Lr : 
Evin = /# Hoa” >= Jn-1 (am *) sin né. 
0 


The field components of each mode have approximately the same radial 
dependence, varying as Bessel functions of the first kind, and tending 
to negligibly small values at the boundary (6). This approximate radial 
dependence (10), (11) and (12) is reproduced under each mode pattern 
in Figs. 2(a), 2(b) and 2(c). 


(12) 


III, PROPAGATION CONSTANTS FOR THE GENERAL CIRCULAR CYLINDRICAL 
GUIDE 


In this section we shall determine the propagation constants y, of 
the Tom , TMon and EH,,,, modes in the straight hollow guide at optical 
wavelengths. The propagation constants are the roots of the following 
characteristic equation for the general circular cylindrical structure.” 
They are related to k; and k, by expressions (5). 

[Jed ki ae |= (kia) 7k | 


J ,(k a) ke Ha (ka) 


rele 
kk;a ke : 


This equation is simplified substantially when the approximations in 
(1) are introduced. Since k,a > 1, the asymptotic value of the Hankel 
functions may be used 


J, (kia) hk, H, (ka) a3) 





Y" (ka) 
fee wit O(1/ka), ka> 1. (14) 
Since 
y y IN ) e 
a ae i (3 <1 (15) 


powers of »’/k.a larger than one shall be neglected. The characteristic 
equation then simplifies to 


Inika) = ty,(ki/k) J a(kia) (16) 
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where 


ye 
Vr —1 


yp? 


Vy = PEA for TMy,, modes (n = 0) (17) 
3(7? + 1) 
VP — 1 


To solve the characteristic equation for k,a we notice that because 
of (1) and (5), the right-hand side of (16) is close to zero. Using a 
perturbation technique and keeping only the first term of the perturba- 
tion, 





for TEom modes (n = 0) 








for EHym modes (n # 0). 


kia Unn(1 — tv,/ka) (18) 

where Unm as before is the mth root of dhe equation 
Jn—1(Unm) = 0. (19) 
The validity of (18) is assured provided that the order of the mode is 


low enough so that | », | Unum « ka. The propagation constants y can 
then be obtained from (5) 


Lie ivnd 
mkl1—= - , 
revel 3 (3) (1-33) 0 
The phase constant and attenuation constant of each mode are the real 
and imaginary parts of y, respectively, 











sanmeo fsfeeT fo m(2)] 
(21) 
ia Gy (a2) Spe Boe 


IV. PROPAGATION CONSTANTS FOR STRAIGHT DIELECTRIC GUIDES 


For guides made of dielectric material, », is usually real and inde- 
pendent of \, so that the phase and attenuation constants arc 
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Qn 1 (ttnmd\ 
Pa aa 3 (ey } 


1 
Vesa , for Tom modes (n 





I 


0) 


2\2 2 (22) 
Onn = (¢=") WN SRr7 for TMom modes (n = 0) 


2r ]/ a 


2(" + 1) 
2." 7 
mae for HH,m modes (n ¥ 0) 


The phase constant of modes in hollow diclectric waveguides have the 
same frequency dependence as modes in perfectly conducting metallic 
waveguides when operating far from cutoff; both transmission media 
are then similarly dispersive. 

The attenuation constants are proportional to \”/a®. Consequently, 
the losses can be made arbitrarily small by choosing the radius of the 
tube a sufficiently large relative to the wavelength i. 

The refractive index »v affects the attenuation of each of the three 
types of modes (22) in different ways. This fact is reasonable on physical 
grounds. TE, modes can be considered to be composed of plane wave- 
lets, each impinging at grazing angle on the interface between the two 
media with polarization perpendicular to the plane of incidence. It is 
known from the laws of refraction that the larger the value of v, the 
smaller the refracted power. 

TMom modes may also be thought of as consisting of plane wavelets, 
but with the electric field of each now contained in the plane of incidence. 
Tor v very close to unity, there is little reflection and the refracted loss 
is high; as the value of v is allowed to become large, each wavelet gets 
close to the Brewster angle of incidence and again the refracted loss is 
high. The minimum occurs for » = +/2. 

EH,m modes are composed of both types of plane wavelets. Therefore, 
as is reasonable from the above argument, the attenuation constant 
Qnm has a v dependence which is an average of those of TEom and TMom 
modes. The value of » that minimizes anm is v = W/3 = 1.73. 

The attenuation constants (22) are proportional to unm. Some values 
Of Unm (19) are presented in Table I. For a fixed value of n the attenua- 
tion constant increases with m. This statement is not true for m fixed 
and n variable. 

Comparing the attenuation constants (22) of the different modes, 
we find that the mode with lowest attenuation is TEo: if » > 2.02 and 
EH, if » < 2.02. Most glasses have a refractive index v ® 1.5, and 
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TABLE I— SomE VALUES OF Unn 











n/m 1 2 3 4 
1 2.405 5.52 8.654 11.796 
2 or 0 3.832 7.016 10.178 13.324 
3 or —1l 5.136 8.417 11.62 14.796 
4 or —2 6.380 9.761 13.015 16.223 





consequently for hollow glass tube EH, should be preferred. The attenu- 
ation of this mode (8686a;; in db/km) has been plotted in Fig. 7 as a 
function of \/a for » = 1.50 using \ as a parameter. Typically, for a 
wavelength \ = ly and radius a = 1 mm, the attenuation of the EHy 
mode is 1.85 db/km (23 db/mile). If the radius of the guide is doubled, 
the attenuation is reduced to 0.231 db/km. 
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V. HOLLOW DIELECTRIC WAVEGUIDE FOR OPTICAL MASER AMPLIFIERS 
AND OSCILLATORS 


A mode traveling in a hollow dielectric waveguide filled with ‘‘masing”’ 
material experiences a net gain which is given by the difference between 
the amplification due to the active medium and the loss due to leakage 
through the walls. It has been shown" that in a tube filled with the 
right mixture of He and Ne at the proper pressure, the gain G is in- 
versely proportional to the radius a of the tube. Then 


. G = (A/a) db/m (23) 
where the radius a is measured in meters and the constant A is 
A = 0.00066 db. 


On the other hand, we have found that the transmission loss of the 
EH, mode in the hollow waveguide with a refractive index v = 1.50 
is L = 8.686ay, . From (22) 


L = B()’/a*) db/m (24) 
where the constant B is 
B = 1.85 db. 
The net gain per unit length is then 
G — L = (A/a) — B()’/a’) (25) 


passing through a maximum at the value of the radius for which 
a(G — L)/da = 0. 


The optimum radius and the maximum net gain are respectively 


Qopt = VV 3X = 91.7r 


2 Atl 10° a 
a L max — zi qe ee \ : 
Grd) 31 Bi 481 5~ db/m 
For the He-Ne mixture, \ = 0.6328 10° m. Consequently 
Apt = 0.058 mm 
(27) 


(G — L) mex = 7.6 db/m. 


Although the diameter of the tube is quite small, the gain per unit length 
is sufficiently large as to make hollow dielectric amplifiers and oscillators 
attractive for experimentation. 


LONG DISTANCE OPTICAL COMMUNICATION 1799 


Present-day confocal He-Ne masers employ tubes whose approximate 
length and radius are 1 m and 3 mm respectively. The gain per passage 
(23) is 0.22 db (25 per cent). If a hollow dielectric waveguide with an 
optimum radius 0.058 mm were used, the same gain would be achieved 
with a length of only 0.22/7.6 = 29 mm. This presents an excellent 
possibility for a very compact maser. 

Even for radii larger than the optimum, the hollow dielectric wave- 
guide is still attractive. or example with a = 0.25 mm, the gain is 2.6 
db/m, a value far larger than the gain 0.22 db/m obtained for the 3-mm 
radius tube commonly used for masers. 

Nevertheless, for long-wavelength masers the optimum values (26) 
are not practical. Consider for example a tube containing an active 
material which amplifies at )\ = 10°“ m. Let us assume that the constant 
A is still 0.00066. Then from (26), the optimum radius and maximum 
gain are 


Gopt = 9.14 mm 


(28) 
(G — DL) max = 0.0481 db/m. 


The gain is very small. It could be enhanced by reducing the radius and 
by increasing the refractive index » of the walls to a value much larger 
than 1.5. This can be accomplished if metal is used instead of diclectric, 
as is shown in the next section. 


VI. ATTENUATION CONSTANTS FOR THE STRAIGHT METALLIC GUIDE 


In order to discuss the attenuation characteristics of metallic wave- 
guides, we shall need to have some quantitative information about the 
behavior of metals at optical frequencies. We examine as a typical 
example the optical properties of aluminum, even though this may not 
be the most suitable metal. The dispersion characteristics of the con- 
ductivity and relative dielectric constants of aluminum have been 
studied extensively by Hodgson,” Beattie and Conn,“ and Schulz.” 
The data used below have been taken from a compilation of the results 
of these studies,’® and is presented graphically in Fig. 8. It is evident 
from these dispersion curves that the dielectric constant for aluminum 
is much larger than for ordinary dielectrics and increases monotonically 
with wavelength in the range 0.84 < X < 4.0un. 

The circular electric modes have the lowest loss in metallic wave- 
guides, while the circular magnetic and hybrid modes are rapidly 
attenuated even for a wavelength as short as 0.3. The attenuation con- 
stant ao for the lowest-loss TEo: mode is plotted in Fig. 9 for wave- 
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WAVELENGTH, A, IN MICRONS 


Fig. 8 — Dispersion curve for aluminum v? = e/ep = Re(v?) + 7 Im(v?) versus 
wavelength A(u). 


lengths in the range 0.8u < \ < 4.0u for a = 0.25 mm, 0.50 mm and 
1 mm. These data show a considerable improvement over that cor- 
responding to the lowest-loss mode EH, for the dielectric guide. We 
saw that for a hollow glass dielectric waveguide, the EH; mode has a 
loss of 1.8 db/km for a radius a = 1 mm and wavelength \ = ip. The 
attenuation for the TE: mode for the aluminum guide with the same 
radius and wavelength is only 0.028 db/km. For a wavelength \ = ly 
and a radius a = 0.25 mm, the minimum-loss TE; mode for the alumi- 
num waveguide has an attenuation constant aj, = 1.8 db/km. The 
same attenuation is achieved for \ = 3u anda = 0.6 mm. The attenua- 
tion constant for the TEo2 mode under the last two conditions is ao. = 
6.05 db/km. For a wavelength X = lp and a = 0.25 mm, the straight 
guide losses for the TMo; and EH; modes are approximately 145 db/km 
and 57 db/km, respectively. 


VII. FIELD CONFIGURATION AND ATTENUATION OF MODES IN THE CURVED 
GUIDE 


In order to achieve a more realistic evaluation of the hollow circular 
waveguide for long distance optical transmission, it is necessary to 
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WAVELENGTH, A, IN MICRONS 


Ee. 9 — Attenuation of TEo: mode, ao: versus wavelength A(z), for aluminum 
guide. 


evaluate the effects of mild curvature of the guide axis. This is most 
easily accomplished by determining a perturbation correction for both 
the field configuration and the attenuation constants for the idealized 
straight guide whose characteristics have been described above. 
VIII. FORMULATION OF THE PROBLEM 
Consider the toroidal system (r, 6, z) with metric coefficients 
ep = 1 
é& =r (29) 
ee = 1+ r/R sin 0 


as depicted in Fig. 10. In this system of coordinates, a differential 
length is given by 


ds = (e,dr’ + ede” + e7dz’)? (30) 


where # is the radius of curvature of the toroidal system and is chosen 
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Fig. 10 — The curved hollow dielectric waveguide and the associated toroidal 
coordinate system (r, 6, Z). 


equal to the radius of curvature of the guide axis, so that the guide wall 
is located at r = a, and the axis of the guide coincides with the curved 
z-axis. In this toroidal coordinate system, Maxwell’s equations are 


5 {(1 + r/R sin 6)35C,} — dyerBle + iwer(1 + r/R sin 60)& = 0 
ty SC, — = {(1 + r/R sin 6)3C,} + iwe(1 + r/R sin 0)& = 0 


~ (r35Cy) — =, i, + iwer& = 0 


(31) 
5 {(1 + r/R sin 0)8.} — 7y-& — iwur(1 + r/R sin 0)%, = 0 
1y&r — < {(1 + r/R sin 6)&.} — iwu(1 + r/R sin 6)5Cp = 0 


6] 0 ; 
ar (r&) — "0 & — twurse, = 0 


where we have omitted the common factor 
exp 1(y.2 — wt) 


in which ¥, is the propagation constant along the curved z-axis. 

The toroidal system (r, 6,2) and the curved waveguide degenerate 
into a cylindrical system and a straight guide, respectively, as # ap- 
proaches infinity. Maxwell’s equations for the straight guide are there- 
fore obtained from (31) by letting R > ~. 
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) ; ‘ 
36 H, — iyrHe + iwerH, = 0 
7 0 : 
iyH, — 3p He + iweli, = 0 


re) 0 : 
oe (rHe) — ag Ul + iwerH, = 0 
(32) 
0 : ; 
ap EH, — tyrEy — iwerH, = 0 


i =: Se cute =O 
or 


0 0 : 
or (rE4) = ap Le = twurH, = 0 


where vy is the propagation constant for the straight guide, and the 
superscript 7 and subscripts nm are suppressed. 


IX. SOLUTION FOR THE CURVED GUIDE 


We proceed to solve (31) for the field vectors &, 5C and obtain the 
propagation constant y. for the curved guide as functions of the field 
vectors £, H and the propagation constant y of the straight guide. The 
latter quantities are known [(2), (3), (4) and (20)]. We introduce a 


parameter 
k a 2ra\" a a 
C= y Ego (28) R° (33) 


The range of interest is that for which the radius of curvature R is so 
large that o < 1. 
Using a first-order perturbation technique, the solution of (81) is 





& = (1 + or/asin 6) 
& = (1 + or/asin 6)E, 
& = (1 + or/asin 0)E, + (tc/ka)(E, sin 6 + Eo cos @) (34) 
Ke = (1 + or/asin 0)H¢ 


a, = (1 + or/a sin 0)H, 
a, = (1 + or/asin 6)H, 4+ (to/ka)(H, sin 6 + Ho cos @). 


The effect of curvature of the guide axis is to make unsymmetrical the 
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transverse field configuration of the straight guide. Each transverse 
component is enhanced in the half cross section farthest from the center 
of curvature. 

To a first-order perturbation of o, the propagation constants of the 
curved and straight guide are identical; 1.e., y, & y. Nevertheless, know- 
ing the field components of the mildly curved structure, it is possible 
to calculate its attenuation constants anm(R) = Rey, . 


X. ATTENUATION CONSTANTS Qnm(I) 


The mean radial power flowing into the dielectric per unit length 
at the surface of the guide is 


220 
P, = al Re [&3C,* — &,5Ce*] [1 + a/R sin Ola dé. (35) 
a v0 


r=a, 


The power flow in the axial z direction within the internal medium 
r<ais 


a Qn 
pick ; / | Re [€,5¢p* — &50,*]r do dr (36) 
0 0 


and decreases along z at a rate equal to the radial flow per unit length 
Praies 


dP, 
dz 


where a@nm(R) is the attenuation constant of the mode under considera- 
tion for the curved hollow dielectric waveguide. Consequently 


Onm (Ee) ae. aE if fads (38) 
To compute P, we substitute the known field quantities into (35). 
This yields 


€ tamed n?(tam) f°* 
P, = Re 4/2 Se any : |1+osiné| (1+ a/R sin 6) 
: = 





= —2anm(R) Pz = ae (37) 


[ 1 for TEom modes (39) 
y dé for TMon modes 
v sin? n(@ + 4) + cos? n(6 + 6) for EH,» modes. 


Terms with powers of \/(27a) larger than two have been neglected. 
Upon integrating, 


P _ Re 4/2 im a Unm) 
nr St uo kar — I 
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(1 + 40°) for TEom modes 
y(1 + 4o’) for TMom modes 
(40) 
oe ee 6,(+1) ” — 1 
Leen frege(it Fo ay 
for EH,,,, modes 
“COs 20) 
where 
fl, n= +1 (41) 


SSG. qe 


The power P, flowing radially in the guide is obtained by substituting 
(34) into (36) and integrating 


P, = = £8 42 (am) {1 + 2 [1 + 2n(n — 2) fn’ (42) 


Hence 


nm (BR) = eam( 0) f cle ; (22) (2) 


ae (43) 
= 2 
—_ n(n — 2) zk S(t) Re V7 — 1 cos 28 
; Mae 4 R rtd 
: Vr — I } 
where Qnm( ©) = Qnam is the attenuation constant for modes in the 


straight guide (R = o«) given by (21). The attenuation constant 
Qnm(F) can also be written in the following form 








Onm(R) = aam( ©) + (a°/NR’) ReVnm(v) (44) 
where 
1 
VSP —1l 
4 v Qn \’ 
Vam(v) = o a/v = 1 (=) 


M+) a 








1806 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1964 


The values of ReVam(v) are always positive. Some of them have been 
calculated in Table II for a refractive index » = 1.50. 

The attenuation constant of any mode consists of two terms (44). 
The first coincides with that of the straight guide and is proportional 
tO Unm \’/a’; the second term represents an increase in attenuation due 
to curvature of the guide axis and is proportional to a°/\’R’unm . There- 
fore the lower the straight guide attenuation constant (small unm d’/a’), 
the larger the loss due to bends and vice versa. From (43) or (45) we 
find that only for the EH4:,, modes, the orientation of the field with 
respect to the plane of curvature influences the attenuation. If 4 = 0, 
the electric field in the center of the guide is in the plane of curvature 
and the attenuation is a maximum. For 6) = -+7/2, the electric field 
is normal to the plane of curvature and the attenuation is a minimum. 
The ratio of maximum to minimum is mild, however. For the lowest 
attenuation mode EH, and » = 1.50, it is 


Vw (60 = 0) 


——__-_— = 1.65. 4 
If | v | > 1, that ratio is 
Vinm (80 = 0) = 
1.0240)" 4.6. 


From equation (43) we find that the radius of curvature which doubles 
the straight guide attenuation is 


_ 2 (2nVa 
MW 7g a ee 
fo es) 
U 4 


nm 


Re /? — 1 + (47) 
— foaeL. COs 29 


I —S SS 
Vr — 1 


This value of Ry is only approximate since (43) was derived by assum- 
ingo <1. 


XI. EFFECT OF CURVATURE ON ATTENUATION OF MODES IN THE HOLLOW 
DIELECTRIC WAVEGUIDE 


Yor a straight hollow glass waveguide with » = 1.5 and a radius 
a = 1 mm operating typically at a wavelength \ = 1p, the attenuation 
of the lowest-loss mode EHy is an = 1.85 db/km. This loss is doubled 
for a radius of curvature Ry) ~ 10 km. For long distance optical trans- 
mission a radius of curvature of at least a few hundred meters would 
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Tasie II — Some VALuEs OF Vp,,(v) 


n/m 1 2 3 4 


—1 | 2.57 (1 + 0.326] 1.034 (1 + 0.301! 0.553 (1 + 0.295 [0.347 (1 + 0.293 














cos 26) cos 26) Cos 249) cos 269) 

TE 3.22 0.955 0.455 0.265 

: TM 7.22 2.145 1.022 0.596 
1 15.5 (1 + 0.246} 2.60 (1 + 0.279] 1.034 (1 + 0.284 |0.554 (1 + 0.286 

cos 26) COS 260) Cos 26) cos 260) 

2 5.22 1.55 0.735 0.482 

3 2.57 1.034 0.553 0.347 

4 or —2 1.51 0.737 0.4380 0.287 





be tolerable. Therefore hollow dielectric waveguides do not seem suita- 
ble for long distance optical transmission. . 

On the other hand, the curvature in hollow dielectric waveguides for 
application in gaseous amplifiers and oscillators is not critical. For 
example, if a = 0.25 mm and \ = ly, the straight guide attenuation is 
0.12 db/meter. The radius of curvature which doubles this quantity 
for the lossiest polarization —i.e., with the electric field at the center 
of the guide contained in the plane of curvature —is approximately 
150 meters, a value well within the limits of laboratory precision. Con- 
sequently, the hollow dielectric waveguide does remain very attractive 
as a guiding medium for optical amplifiers and oscillators where a small 
guide radius is desirable, thereby making the guide less sensitive to 
curvature of the axis. 


XII. EFFECT OF CURVATURE ON ATTENUATION OF MODES IN THE METALLIC 
GUIDE 


The attenuation constants ao,(f) for the lowest-loss TEom modes 
in the curved metallic guide are given by 


com(R) & aom() {1 ef. + (2) (z)} (48) 


where aom(*) is the attenuation constant for the TE», mode in the 
straight guide, R = o. For a radius a = 0.25 mm and wavelength 
d = Ip, the straight guide loss for the lowest-loss TE»: mode, an1( ©) = 
1.8 db/km, is doubled for a radius of curvature of only Ro 48 meters. 
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For \ = 3p and a = 0.6 mn, the straight TE; loss is also 1.8 db/km 
and the radius of curvature that doubles that loss is 75 m. 


XIII. CONCLUSIONS 


The hollow dielectric waveguide at optical wavelengths supports a 
complete set of normal modes that are either circular electric, circular 
magnetic or hybrid. They resemble the modes found in a sequence of 
circular irises not only in field configuration but also in loss discrimina- 
tion among them. For hollow metallic waveguides the mode discrimina- 
tion is far larger. 

The field configuration and propagation constants have been de- 
termined. The attenuation is practically independent of the loss tangent 
of the dielectric but depends essentially on the refraction mechanism 
at the wall. Assuming refractive index of the dielectric, 1.5 for hollow 
dielectric waveguides, the EH mode exhibits the lowest power attenua- 
tion, viz., 1.85 (d’/a*®) db/m. For a wavelength } = ly and a tube 
radius a = 1 mm, the attenuation is only 1.85 db/km. 

The hollow dielectric waveguide does not, however, seem suitable for 
long distance optical transmission because of the high loss introduced 
by even mild curvature of the guide axis. Nevertheless it remains very 
attractive as a guiding medium for optical amplifiers and oscillators, 
since here a small radius of the guide is desirable. Consequently, curva- 
ture of the guide axis is not critical. Filled with ‘“‘masing” material, the 
hollow dielectric waveguide provides not only guidance but also gain 
which is almost inversely proportional to the radius. For the right 
He-Ne mixture, the maximum theoretical gain attainable is 7.6 db/m 
provided that the radius is 0.058 mm. But even if the radius is 0.25 mm, 
the predicted gain is still large, viz. 2.6 db/m. 

The metallic waveguide is superior to the hollow dielectric waveguide 
for use in long distance optical transmission. Because of the relatively 
large dielectric constant exhibited by aluminum at optical frequencies, 
the attenuation constant for the lowest-loss mode TE; is comparatively 
small and less sensitive to curvature of the guide axis. For a radius 
a = 0.25 mm and a wavelength \ = ly, the attenuation constant for 
TE modes in the straight aluminum guide is only 1.8 db/km, which is 
doubled for a radius of curvature of about 48 meters. For a = 0.6 mm 
and \ = 3un, the TE straight guide loss is also 1.8 db/km but is doubled 
if the radius of curvature of the waveguide axis is 75m. 

We have considered some of the theoretical problems of the hollow 
dielectric or metallic waveguide. The results are promising. Neverthe- 
less, the usefulness of these guides has yet to be proven experimentally, 
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and furthermore the attenuation constants discussed here do not include 
scattering losses due to surface imperfections. 
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A Condition for the £..-Stability 
of Feedback Systems Containing a 
Single Time-Varying Nonlinear Element 


By I. W. SANDBERG 
(Manuscript received May 14, 1964) 


In the automatic control literature, a feedback system is frequently 
said to be stable if, regardless of the initial state of the system, each 
bounded input applied at ¢ = 0 produces a bounded output. The pur- 
pose of this brief is to present a sufficient condition for the feedback 
system of Fig. 1 to be stable in this sense. 

Our discussion is restricted to cases in which g,, f, vu, and v (in Fig. 1) 
denote real-valued measurable functions of ¢ defined for ¢ 2 0. The 
block labeled y is assumed to represent a memoryless time-varying 
element that introduces the constraint u(t) = y[f(¢),t], in which y(a,t) 
is a function of x and ¢ with the properties that y(0,¢) = 0 for t = 0 and 
there exist a positive constant 6 and a real constant @ such that 

= va, 


a < <p, ¢ 
v 


0 


IV 





for all real x ¥ 0. 
The block labeled K represents the linear time-invariant portion of 
the forward path, and is assumed to introduce the constraint 


v(t) = a k(t — r)u(r)dr — gp(t), t= 0 (1) 


in which k and g, are real-valued measurable functions such that 
[ lk@lat< ©, sup |gl| < @. 
0 t>0 


The function g. takes into account the initial conditions at t = 0. We 
do not require that wu and v be related by a differential equation (or by a 
system of differential equations). 
Assumption: We shall assume that the response v(t) is well defined 
and such that for all finite y > 0 


sup | v(t) | < , 
O<st<y 
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Fig. 1 — Nonlinear feedback system. 


for each initial-condition function ge that meets the conditions stated 
above and each input g, such that 


sup | g(t) | < @. 
t>0 


Definition: We shall say that the feedback system of Fig. 1 is ‘‘£,,-stable”’ 
if and only if there exists a positive constant p with the property that the 
response v satisfies 


sup | v(t) | S p sup | gi(t) + go(t) | + sup | go(Z) | 
t>0 t>0 t>0 
for every initial-condition function g, that meets the conditions stated 
above and every input g; such that 
sup | g(t) | < ©. 
t>0 
Clearly, if the system is £,,-stable, then the response is bounded 
whenever the input is bounded. 
Theorem: Suppose that 
(i) with K(s) = [ k(i)e“‘dt for Rels] = 0, 
0 


1+ 4(a+ 8)K(s) #0 for Rels] = 0, and 
(17) with h(-) the inverse Laplace transform of 


K(s) Fs 
1+ 3(a + B)K(s)’ 


122) [aw tae a 


IV 


Then the feedback system of Fig. 1 ts £,,-stable. 
It is of interest to note that condition (77) is satisfied whenever 
condition (2) is met, a > 0, h(t) = 0 fort = 0, and 


[ k(t) dt > 0, 


* Condition (7) and our assumption regarding an ) imply that A(-) exists, and 
that its modulus is integrable on [0,0 ) [see Ref. 1]. 
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The theorem can be proved with the techniques discussed in Refs. 2 
and 3. More specifically, consider the relation between f and (g: + gz): 


nd) +a) = 70 + | KAD Oais. £0: @) 


and suppose that 
sup | gi(t) + g(t) | < ~. 
t>0 


Arguments very similar to those used to prove Theorem 1 of Ref. 2 
and Theorem I of Ref. 3 show that if (a) f satisfies (2), (b) 


sup | f(t) | < 
O<st<y 


for all finite y > 0, and (c) conditions (7) and (77) of our theorem are 
satisfied, then there exists a positive constant p,: (which does not de- 
pend upon g; or gz) such that 


sup If) |S pm ey | gi(t) + go(t) |. 


Our theorem is a direct consequence of this fact, in view of (1) and the 
relation between u and f. 
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Thermal Gas Lens Measurements 


By A. C. BECK 
(Manuscript received June 12, 1964) 


The refractive index of a gas is an inverse function of its temperature 
under isobaric conditions. Therefore a cool gas flowing into a heated tube 
will have a lower refractive index near the heated walls than in the 
center, and the combination becomes a convex lens which will focus a 
light beam transmitted through the tube.!? 

A simple arrangement was built to get some information about the 
behavior of such a device. It is sketched in Fig. 1. The gas flowed through 
a 5-inch long electrically heated brass tube with a +3-inch inside diam- 
eter. The tube was mounted in a large polyfoam cylinder to reduce 
external heat losses. A single-mode light beam from a helium-neon gas 
laser oscillating at ) = 0.63 micron was collimated at a diameter of 
about 4 inch with glass lenses and then sent along the axis of the heated 
tube, through which a gas was flowing. The light was intercepted on a 
screen about 10 to 20 feet away. When this system, acting like a convex 
lens, is placed in such a collimated beam, the light goes through a focus 
at the lens focal length, and then expands to form a much larger area of 
light on the screen. A Foucault knife edge cutting the beam at the focus 
was used for measuring the focal length of the lens. 

The reciprocal of the lens focal length in meters (often called the 
focusing power or convergence of the lens, and usually expressed in 
diopters) is plotted as a function of wall temperature rise on Fig. 2. 
Measurements were made with air and with carbon dioxide at the 
indicated flow rates. Helium was found to give very small effects, as 
expected.? 

It will be noted from Fig. 2 that the lens power increases more slowly 
with flow rates at the higher values. At somewhat higher flow rates it 
flattens off and stops increasing, but at these very high flow rates, gas 
turbulence sets in and laminar flow is no longer present, so accurate 
measurements become impossible. At the flow rates shown on Fig. 2, 
constant and steady light patterns and focal lengths are observed when 
heated gas emerging from the tube is kept out of the light beam. This 
indicates that when laminar flow exists, the gas is very stable in a tube 
as small as this one; therefore steady optical lens effects are obtained. 
At higher temperatures the lens power continues to increase, at least 
up to the temperatures obtainable with this equipment. At the highest 
lens powers there is an effect that appears like spherical aberration, 
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Fig. 1 — Experimental arrangement for thermal gas lens. 


shown by a concentric pattern of light rings beginning to appear on the 
screen,?* but this is very slight in the ranges shown on Fig. 2. 

The brass tube was heated with direct current, and the input electri- 
cal power was measured for all points. The electrical power required 
with no gas flow was also measured as a function of temperature. The 
heat power taken away by the flowing gas was the difference between 
these two values at the measured temperature rise. From this informa- 
tion, a “figure of merit,’’ defined as the reciprocal of the focal length in 
meters divided by the watts of heat extracted by the flowing gas (diopters 
per watt), can be calculated. 

Representative data, and some comparisons with theoretical values, 
are summarized in Table I. The theory, notation, and gas constants 
are given by Marcuse and Miller.? 

Agreement between calculated and measured values is fair for heat 
power consumption of the flowing gas, but not as good for focusing 
power, particularly at large flow rates of the heavy gas. 

The focusing power and figure of merit of these lenses can be im- 
proved by using longer sections of tubing, since this one was not op- 
timized. They may also be improved by using other gases having a 
higher (n — 1)/k ratio, where n is the refractive index and k is the heat 
conductivity of the gas, and by operating at higher gas pressures. 

Thermal gas lenses have short focal lengths with low power consump- 
tions, and the effects are remarkably steady. The focusing power can 
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Fig. 2 — Lens focusing power versus wall temperature rise. 


be adjusted easily by changing the gas flow rate. Such devices may find 
applications to long distance optical communication systems. 

The assistance of W. E. Whitacre in constructing and setting up the 
equipment is gratefully acknowledged. 


TaBLE I] — REPRESENTATIVE DATA AND THEORETICAL VALUES 























1/F (diopters)|Power (watts)| ment (alo 
Flow Rate 1opters)|Power (watts)| meri lop- 
k t tt 

Gas es (enh/see) Guess vo/V ers/watt) 
calc. | meas. | calc. | meas. | calc. | meas. 
Air 1 27.24 105 3.87 |1.67| 1.38 |1.74] 1.8 |0.96]0.72 
Air 2 27.24 211 7.74 |1.95) 2.1 |2.75|] 2.9 |0.71/0.72 
CO, 1 13.52 105 7.80 |3.00| 3.8 |1.73) 2.0 | 1.73] 1.90 
CO, 2 13.52 211 15.6 2.02) 4.7 |2.49; 2.9 10.81) 1.62 

REFERENCES 


1. Berreman, D. W., A Lens or Light Guide Using Convectively Distorted Thermal 
Gradients in Gases, B.S.T.J., 48, July, 1964, Part 1, p. 1469. 
2. Marcuse, D., and Miller, S. E. , Analysis of a Tubular Gas Lens, B.S.T.J., this 


issue, p. 1759. 
3. Born, Max, and Wolf, Emil, Principles of Optics, Pergamon Press, 1959, pp. 472- 
477. 


4. Beck, A. C., Gas Mixture Lens Measurements, B.S.T.J., this issue, p. 1821. 


B.S.T.J. BRIEFS 1821 


Gas Mixture Lens Measurements 


By A. C. BECK 
(Manuscript received June 15, 1964) 


Different gases have different refractive indexes. If two different 
gases are introduced into different parts of a straight hollow tube and 
allowed to continue in laminar flow, they diffuse into each other slowly 
but remain sufficiently separated so that a refractive index gradient is 
maintained for some distance. It has been suggested by A. R. Hutson, G. 
K. Conklin, and the author that various optical components such 
as prisms and lenses can be obtained by using such gaseous structures. 
These components have very low losses and reflections, with no solid 
surfaces to cause matching or cleaning problems. For these reasons they 
may have important applications in optical communications systems.!:?.3 

A simple structure, sketched in Fig. 1, was built to test this principle 
and to get some information about the performance of a gas mixture 
lens. A cylindrical porous-walled tube with an inside diameter of + inch 
and an exposed length of 3 inches was mounted inside a larger phenolic 
cylinder with its ends blocked to form a gas reservoir. A single-mode light 
beam from a helium-neon gas laser oscillating at ) = 0.63 micron was col- 
limated at a diameter of about ¢ inch with glass lenses and then sent 
along the axis of the porous tube through which one gas was flowing. 
The second gas was introduced into the tube through the porous walls. 
The light beam was then intercepted on a screen about 10 to 20 feet 
away. If there were no optical refractive effects on this beam, a spot 
about $ inch in diameter was seen at these distances, since the beam 
was collimated. A diverging lens increases the spot size without the 
beam going through a focus between the lens and the screen. A con- 
verging lens causes the beam to become smaller, go through a focus, 
then expand, forming a much larger spot on the screen at distances of 
many focal lengths. When measuring converging lenses, a Foucault 
knife edge cutting the beam was used to determine the focal point and 
thus the focal length of the lens. 

With this system, diverging lenses are obtained if the gas of higher 
refractive index is introduced through the walls of the porous tube while 
the gas with the lower index flows down the tube parallel to the light 
beam. Conversely, sending the less refractive gas through the porous 
wall of the tube through which the more refractive gas and the light 
beam are passing gives a converging or positive lens. The reciprocal of 
the lens focal length in meters (often called the focusing power or con- 
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Fig. 1 — Experimental arrangement for gas mixture lens. 


vergence of the lens, and usually expressed in diopters) is plotted for 
various gas flow rates in Figs. 2 and 3. Three different gas mixtures are 
shown, although other gas mixtures might have even greater focusing 
power. With carbon dioxide-helium and air-helium, focusing power in- 
creased with increased flow rate in the range of Fig. 2, but Fig. 3 shows 
that this is not the case for the carbon dioxide-air mixture in the range 
shown there, since focusing power at first increased, reached a maximum, 
then decreased as the carbon dioxide flow rate was increased. For these 
curves, Dextilose paper, No. 17V, was rolled up with about five thick- 
nesses to form the porous tube. Other materials work equally well or 
better, porous ceramics, sintered metals, fritted glass and fine copper 
screen showing similar results. 

At the indicated flow rates, the patterns and focal lengths are constant 
and steady when the mixed gases emerging from the tube are kept out 
of the light beam. This shows that the gas mixture is quite stable when 
laminar flow exists in the tube. The tube continued beyond the porous 
region exposed to the second gas for about 13 inches, but greater focusing 
power than that shown here was obtained with longer exit tube lengths. 
This indicates that the gases were still in laminar flow and were not 
completely mixed in this short length, so that focusing continued with 
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Fig. 2 — Lens focusing power versus gas flow rates, air-helium and carbon 
dioxide-helium. 


more length. At higher flow rates, particularly of the gas first introduced 
into the system, turbulence appeared. This broke up the light beam 
and gave unsteady and flickering patterns on the screen. 

Some photographs of patterns observed with gas mixture arrange- 
ments are reproduced on Fig. 4. At the upper left, the image of the beam 
is shown with no gases flowing in the tube. The diameter of this spot 
on a screen at any distance was about § inch. At the upper right is the 
enlarged spot formed by carbon dioxide flowing down the tube and 
helium introduced through the porous tube walls at rather high flow 
rates so that a short focal length (approximately 6 inches) was obtained. 
This pattern, of course, increased in size as the screen was moved farther 
away. At lower flow rates, giving longer focal lengths, the light pattern 
is quite uniform, but as the focal length is made short, concentric rings 
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Fig. 3 — Lens focusing power versus gas flow rates, carbon dioxide-air. 


of light begin to appear, as can just be seen here. These rings are similar 
to those which appear when spherical aberration forms a light inter- 
ference pattern. With a 3-inch porous tube, this effect is slight until 
very short focal lengths are produced. However, when the porous tube 
was replaced by two solid brass tubes in line with a gap of about 7 inch 
between them at the center of the second gas reservoir, the light pattern 
shown in the photograph at the lower left was obtained. When the 
porous tube was replaced by a solid brass tube having six 75-inch diam- 
eter holes drilled symmetrically every 60 degrees around its circum- 
ference in a plane located at the center of the second gas reservoir, the 
pattern at the lower right was observed. The use of tubes with different 
numbers of equally spaced holes produced patterns similar to this, having 
an n-fold symmetry where n is the number of holes. The steadiness and 
stability of these patterns of light interference are graphic indications 
of the stability of the gas flow, and the stability of gas lenses. 

There are ways of separating the mixed gases, if desired, by permeable 
membranes or other means so that they can be mixed again in a later 
lens farther along a transmission system. Complete separation is not 
necessary to give lens action. 
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Gas mixture lenses can have short focal lengths with moderate gas 
flows. The effects are remarkably steady and constant. The focal length 
is easily changed by changing the gas flow rates. Such lenses may have 
possible uses in optical transmission systems. 

The assistance of W. E. Whitacre in constructing and setting up the 
equipment is gratefully acknowledged. 
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A Circle Diagram for Optical Resonators 


By J. P. GORDON 
(Manuscript received June 16, 1964) 


A graphical representation of the relationships between the param- 
eters of Hermite Gaussian light beams has been introduced recently’” 
by S. A. Collins, Jr. T. Li has pointed out’ that Collins’ chart has two 
equivalent forms. Collins’ chart relates the spot radius and phase front 
curvature at any position on the beam to the position and spot radius 
of the beam waist. In this note we point out that a similar chart can be 
made which relates the curvature parameters’ of any two phase fronts 
along a Gaussian beam and the spot radii on those phase fronts. The 
curvature parameters are defined as g, = 1 — d/Ri, go = 1 — d/R,, 
where R; and R, are the radii of curvature of the phase fronts, and d 
is their separation. This new chart directly relates mirror curvatures 
and spot radii in a spherical mirror resonator. 

The equations on which the chart is based are’’® 


2 
gi We 
oie ee 1 
oe (1) 
dd = 
WW, = mi (1 = 9192) (2) 


where w; and w, are the spot radii on the two phase fronts. If we elimi- 
nate w, and ge, respectively, from these two equations we get 


o-aye(4y-@) 


( Md _ mui bes 3) a 
mor 2add ”  \and ) 

On a graph whose Cartesian coordinates are \d/rw; and g,, these two 
equations represent circles of diameters 1/g. and (w,'/Ad), respec- 
tively, as shown in Fig. 1. The point of intersection of the two circles 
gives the values of w, and g; which satisfy both (8) and (4). A more 
complete chart, similar to Collins’ chart, can be used to read off spot 
radii in a resonator whose mirror curvatures are known, or to find mirror 
curvatures from measurements of the beam spots. As does Collins’ 
chart, this new chart has two equivalent forms, which in our case differ 
only in having the subscripts 1 and 2 interchanged. 


and 
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Fig. 1 — Circle diagram for spherical mirror resonators. Cartesian coordinates 
of the important points are indicated, along with some geometrical relationships. 
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Gas Pumping in Continuously Operated Ion Lasers 


By E. I. GORDON and E. F. LABUDA 
(Manuscript received June 26, 1964) 


Gas ion lasers! operate at discharge currents of several amperes in 
small-bore tubing. Under these conditions the discharge acts to pump 
gas from the cathode to the anode? and pressure differences in excess of 
10:1 can be established in less than one minute of discharge operation. 
Since the optimum pressure range for laser operation is narrowly defined 
relative to the range of pressures existing in the discharge tube (see Fig. 
1), laser action usually deteriorates or goes out shortly after turn-on. 
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Fig. 1— Laser output vs pressure at various discharge currents. 


When the discharge is turned off for several seconds to allow the pressure 
to equalize and then turned on again, the output returns to its initial 
value only to deteriorate again. 

By placing a connecting tube of high gas flow conductance between 
the anode and cathode as shown in Fig. 2, the pressure difference between 
the anode and cathode can be virtually eliminated. Tubes operated with 
the connecting tube show no deterioration over long periods of time. 
The tube length and bore are chosen so that the sustaining voltages for 
the two paths are comparable. 

The connecting tube is fashioned in the form of a helix to relieve any 
strains that might develop from differential expansion and because a 
helix provides a convenient means of getting a long length of tubing 
into a small volume. 

The connecting tube also serves to eliminate gas separation or cata- 
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Fig. 2 — Photograph of laser illustrating the connecting tube. 


phoresis that occurs when using mixed gases. This is of particular im- 
portance in dc-excited laser discharges such as helium-xenon.? Cata- 
phoresis is no special problem in the helium-neon discharges in the current 
range at which they are operated, and the side tube is unnecessary. 

We are indebted to J. T. Bannon, who constructed the experimental 
tubes. 
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